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Plant. GntI sequences and the use thereof for the production 
of plants having reduced or lacking N-acetyl glucosaminyl 
transferase I (GnTI) activity 

The present invention relates to plant GnTI sequences, in 
particular, plant nucleic acid sequences encoding the enzyme 
N-acetyl glucosaminyl transferase I (GnTI), as well as GntI 
antisense or sense constructs, deduced therefrom, and their 
translation products, antibodies directed against said trans- 
lation products as well as the use of the sequence informa- 
tion for the production of transformed microorganisms and of 
transgenic plants, including those with reduced or lacking 
N-acetyl glucosaminyl transferase I activity. Such plants 
with reduced or lacking N-acetyl glucosaminyl transferase I 
activity are of great importance for the production of glyco- 
proteins of specific constitution with respect to their sugar 
residues. 

Prior art: 

In eukaryotes, glycoproteins are cotranslationally assembled 
in the endoplasmatic reticulum (ER) (i.e. during import into 
the ER lumen) by the attachment of initially membrane bound 
glycans (via dolichol pyrophosphate) to specific asparagine 
residues in the growing polypeptide chain (N-glycosylation) . 
In higher organisms, sugar units located at the surface of the 
folded polypeptide chain are subjected to further trimming and 
modification reactions (ref. 1) in the Golgi cisternae. 
Initially, typical basic Glc 3 Man 9 GlcNAc2 units of the high- 
mannose type are formed by means of different glycosidases and 
glycosyl transferases in the ER, which during the passage 
through the different Golgi cisternae are subsequently 
converted to so-called complex glycans. The latter are 
characterized by less mannose units and the presence of 
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additional sugar residues, such as fucose, galactose and/or 
xylose in plants and sialic acid (N-acetyl neuraminic acid, 
NeuNAc) in mammals (ref. 1,2,3).. The extent of the modifica- 
tions can differ between glycoproteins. Single polypetide 
chains may carry heterogeneous sugar chains. Furthermore, the 
glycosylation pattern may vary for a specific polypeptide 
(tissue specific differences) , and does not always have to be 
uniform with respect to a specific glycosylation site, which 
is referred to as microheterogeneity (ref. 4,5). Up to now, 
the role of asparagine bound glycans is barely understood, 
which i.a. results from the fact, that said glycans may serve 
several functions (ref. 6). However, it can be assumed, that 
e.g. protection of a polypeptide chain from proteolytic 
degradation can also be achieved by glycans of a more simple 
oligomannosyl type (ref. 7). 

Description of problems: 

Glycoproteins are highly important in medicine and research. 
However, large scale isolation of glycoproteins is time-consu- 
ming and expensive. The direct use of glycoproteins isolated 
conventionally often raises problems, since upon administration 
as a therapeutic, single residues of the glycan components may 
cause undesired side effects. In this context, the glycan com- 
ponent predominantly contributes to the physico-chemical 
properties (such as folding, stability and solubility) of the 
glycoproteins. Furthermore, isolated glycoproteins, as already 
mentioned above, rarely carry uniform sugar residues, which is 
referred to as microheterogeneity. 

For the production of glycoproteins for medicine and research, 
yeasts prove to be unsuitable, since they are only able to 
perform glycosylations of the so-called high-mannose type. 
While insects and higher plants exhibit complex glycoprotein 
modifications, these, however, differ from those of animals. 
Therefore, glycoproteins isolated from plants have a strong 
antigenic effect in mammals. In most cases, animal organisms 
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with glycosylation defects are not viable, since terminal 
glycan residues (e.g. of membraneous glycoproteins) mostly 
possess biological signal function and are indispensable for 
cell-cell-recognition during the course of embryogenesis. 
Mammalian cell lines with defined glycosylation defects 
already exist, the cultivation of which, however, is labour- 
intensive and expensive. 



For mammals, different glycosylation mutants have been 
described in detail at the cell culture level (ref. 
7,8,9,10). Said mutants are either defective in biosynthesis 
of mature oligosaccharide chains attached to dolichol pyro- 
phosphate or in glycan processing or show alterations in 
their terminal sugar residues, respectively. Some of these 
cell lines exhibit a conditional-lethal phenotype or are 
defective in intracellular protein transport. The consequen- 
ces of said defects for the intact organism are difficult to 
estimate. It has been observed, that a modification in the 
pattern of complex glycans on the cell surfaces of mammals is 
accompanied by the formation of tumours and metastases, 
although a functional relationship could not yet unambi- 
guously be demonstrated (ref. 9) . Therefore, in mammals gly- 
cosylation mutants are very rare. These defects, summarized 
under HEMPAS (Hereditary Erythroblastic Mult inucleari ty with 
a Positive Acidified Serum lysis test) (ref. 10,11), are 
based either on a deficiency in mannosidase II and/or low 
levels of the enzyme N-acetyl glucosaminyl transferase II 
(GnTII ) , and have strongly limiting effects on the viability 
of the mutated organism. GntI knock-out mice, in which the 
gene for GnTI has been destroyed, already die in utero of 
multiple developmental defects (personal comnunication, H. 
Schachter, Toronto) . 



Until recently, no comparable mutants were known for plants. 
By the use of an antiserum, which specifically recognizes 
complex modified glycan chains of plant glycoproteins and 
which predominantly is directed against the highly antigenic 
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pl-»2 linked xylose residues (ref. 12), the applicant was able 
to isolate several independant mutants from an EMS mutagenized 
F2 population of the genetic model plant Arabidopsis thaliana, 
which no longer showed complex glycoprotein modification 
(complex glycan, cgl mutants) . After at least five back- 
crosses, each followed by intermittent selfings (to screen for 
the recessive defects) , the glycoproteins were analyzed. These 
glycoproteins mainly carried glycans of the Man 5 GlcNAc 2 type, 
indicating a defect in N-acetyl glucosaminyl transferase I 
(GnTI) (ref. 8). Indeed, the Arabidopsis cgl mutants lacked 
GnTI activity (ref. 13), which normally catalyzes the first 
reaction in the synthetic pathway to complex modified sugar 
chains (ref. 1) . However, according to observations so far, 
the viability of the mutated plants is not affected. In recent 
publications, plants are suggested as a putative source for 
the production of pharmaceutically relevant glycoproteins or 
vaccines (ref. 14,15). There however, it was overlooked, that 
glycoproteins isolated from plants may cause severe immune 
reactions in mammals, which up to now obstructed the 
production of heterologous glycoproteins in cultivated plants. 

The applicant could demonstrate by way of example for the 
Arabidopsis cgl mutant, that plants can manage without complex 
modified glycoproteins to a great extent (ref. 13) . Initially, 
secretory proteins are normally glycosylated in the ER of the 
mutant. In the Golgi apparatus of the cgl mutant, however, the 
oligomannosyl chains linked to the polypetide backbone via 
asparagine residues (N-glycosylation) then remain at the stage 
of Man 5 GlcNAc 2 residues, since N-acetyl gluccsaminyl trans- 
ferase I (GnTI) activity is missing (Fig. 1) .. By this bio- 
synthesis block, the plant specific complex glycoprotein 
modification and in particular the attachment of ccl-»3 fucose 
and pi— >2 xylose residues is prevented, whereby the strong 
antigenic effect on the mammalian organism is absent. However, 
Arabidopsis as a herb only has little utilizable biomass. 
Therefore, for the large scale production of biotechnologi- 
cally relevant glycoproteins these cgl plants are less 
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suitable. As an alternative, cultivars, especially Solanaceae, 
such as potato, tobacco, tomato or pepper and furthermore 
alfalfa, canola, beets, soybean, lettuce, corn, rice and 
grain, with missing or highly reduced GnTI activity, would be 
ideal for the production of heterologous glycoproteins in 
plants. For this purpose, the methods of homology-dependent 
gene silencing would be applicable (ref. 16, 17). 

As Fig. 3 demonstrates, the homology of the first determined 
plant GntI sequence from potato (Solanum tuberosum L., St) is 
extraordinary low in comparison to the corresponding known 
sequences of animal organisms (only 30-40% identity at the 
protein level, cf . Fig. 3A) . Therefore, by the use of hetero- 
logous GntI gene sequences an efficient reduction of endo- 
genous complex glycoprotein modification in plants by means of 
antisense or sense suppression, respectively, (ref. 21), 
probably cannot be achieved. 

Thus, in medicine and research there is still the need for a 
cost-effective production in suitable organisms of recombinant 
glycoproteins with a minimum of uniform, i.e. defined sugar 
residues . 

Nature of the present invention: 

Since the applicant for the first time has been able to isolate 
and elucidate plant GntI cDNA sequences, it is now possible 
i.a. to obtain and, in particular, to generate any plant having 
reduced or missing GnTI activity, and to detect the correspon- 
ding mutants, respectively, by means of reverse genetic approa- 
ches following transposon (ref. 18) or T-DNA insertion (ref. 
19) , respectively, so as to produce glycoproteins with low 
antigenic potential in said mutants. 
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i) Enzymes 

Generally, the present invention comprises different N-acetyl 
glucosaminyl transferase I enzymes (EC 2.4.1.101) from plants, 
e.g. potato (Solanum tuberosum L.), tobacco (Nicotiana tabacum 
L.) and Arabidopsis thaliana (L.). In particular, the present 
invention relates to enzymes, which exhibit or contain the 
amino acid sequences given in Fig. 2 and 3B as well as in the 
accompanying sequence protocol. 

Further, the invention comprises enzymes, which are derived 
from amino acid sequences of the above mentioned enzymes by 
amino acid substitution, deletion, insertion, modification or 
by C-terminal and/or N-terminal truncation and/or extension, 
and which - if showing enzymatic activity - exhibit a speci- 
ficity comparable to that of the starting enzyme, i.e. N-ace- 
tyl glucosaminyl transferase I activity, and optionally a 
comparable activity . 

In the present context, by the term "comparable activity 1 ' an 
activity is understood, which is in the range of up to 100% 
above or below that of the starting enzyme. Accordingly, also 
comprised by the invention are derived enzymes or proteins 
with very low or completely lacking enzymatic activity, which 
is detectable by means of one or more of the tests mentioned 
as follows. The enzyme activity is determined by a standard 
assay, which is performed with microsomal fractions either 
under radioactive conditions, e.g. using UDP- [ 6- 3 H] GlcNAc as 
a substrate (ref. 13) or non-radioactive conditions (HPLC 
method; ref 20) . Plant GnTI activity can be detected on the 
subcellular level in Golgi fractions (ref. 21) . On account of 
low yields, however, it is almost impossible to enrich the 
enzyme from plants. 

Alternatively, an enzyme derived according to the present 
invention, may optionally be defined as an enzyme, for which 
a DNA sequence encoding the enzyme can be determined or 
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derived, which hybridizes to a DNA sequence encoding the 
starting enzyme or to a complementary sequence under strin- 
gent conditions, as defined as follows. 

For example, an enzyme derived in such a manner represents an 
isoform, which comprises the amino acids 7 4 to 44 6 of the amino 
acid sequence illustrated in Fig. 2 and in SEQ ID No:l and 2. 
This isoform i.a. lacks the membrane anchor formed by amino 
acids 10 to 29. As a result, this enzyme isoform may be located 
in the plant cytosol. 

As examples for C- and/or N-terminally extended proteins, 
fusion proteins can be mentioned, comprising in addition to 
an amino acid sequence according to the invention a further 
protein, which e.g. exhibits a different enzymatic activity 
or which may be easily detected in another manner, such as by 
means of fluorescence or phosphorescence or on account of a 
reactivity with specific antibodies or by binding to suitable 
affinity matrices. 

Furthermore, the invention comprises fragments cf said enzymes, 
which optionally no longer exhibit any enzymatic activity. 
Generally, these fragments show an antigenic effect in a host 
immunized with said fragments, and may accordingly be employed 
as an antigen for the production of monoclonal or polyclonal 
antibodies by immunization of a host with those fragments, 

Moreover, this invention also relates to N-acetyl glucosaminyl 
transferase I enzymes from other varieties and plant species, 
which are obtainable on account of hybridization of their 
genes or one or more regions of their genes: 

- to one or more of the DNA sequences and/or DNA fragments of 
the present invention, as discussed below and/or 

- to suitable hybridization probes according to the inven- 
tion, which may be prepared on the basis of the amino acid 
sequences mentioned in the sequence protocol considering the 
degeneration of the genetic code. 
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Further comprised by the invention in accordance with the 
above are enzymes or proteins derived from these N-acetyl 
glucosaminyl transferase I enzymes, including fusion proteins 
thereof, as well as fragments of all of these enzymes or pro- 
teins . 



ii) Antibodies 



Another aspect of the present invention relates to the use of 
the amino acid sequences mentioned above and of fragments 
thereof having antigenic effects, respectively, for the pro- 
duction of monoclonal or polyclonal antibodies or antisera by 
immunizing hosts with said amino acid sequences or fragments, 
respectively, as well as of antibodies or antisera, respec- 
tively, per se, which specifically recognize and bind to the 
enzymes and/or antigens described above. The general proce- 
dure and the corresponding techniques for the generation of 
polyclonal and monoclonal antibodies are all well-known to 
the persons skilled in the art. 



Exemplarily, by the use of a fragment of the GntI cDNA 
(nucleotides 275 to 1395) represented in Fig. 2 and SEQ ID 
NO: 1, the recombinant GnTI protein from Solanum tuberosum 
with 10 N-terminal histidine residues (His-tag) was overex- 
pressed in E . coli, and, following affinity purification via 
a metal-chelate matrix, was employed as an antigen for the 
production of polyclonal antisera in rabbits (cf. Examples 5 
and 6) . 



One possible use of the antibodies of the invention resides 
in the screening of plants for the presence of N-acetyl 
glucosaminyl transferase I. 



Binding of the antibody according to the present invention to 
plant protein (s) indicates the presence of N-acetyl glucosa- 
minyl transferase I enzyme detectable with said antibody. In 
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general, this antibody may then be covalentiy bound to a 
carrier in a later step, and optionally be employed for the 
enrichment or purification of the enzyme by means of column 
chromatography . 

On the other hand, a negative binding result using the anti- 
body of the present invention, i.e. lack of binding to the 
plant proteins, may suggest, that N-acetyl glucosaminyl 
transferase I enzyme is absent (or highly modified by muta- 
tion) , and thus, that N-acetyl glucosaminyl transferase I 
activity of a plant investigated is missing or highly reduced. 

Techniques for the realization of the screening assays men- 
tioned above or the enrichment or purif icatior. of enzymes by 
the use of antibody columns or other affinity matrices (cf. 
Examples 5 and 6) are well-known to those skilled in the art. 



iii) DNA sequences 



The present invention further comprises DNA sequences enco- 
ding the amino acid sequences of the invention, including 
amino acid sequences derived therefrom according to the above 
provisions. In particular, the invention relates to the 
respective gene, which is the basis of the amino acid sequen- 
ces described in the Figures 2 and 3B and the sequence proto- 
col, and especially, to the cDNA sequences described in Fig. 
2 and the sequence protocol, as well as to DNA sequences 
derived from these genes and DNA sequences. 

By the term "derived DNA sequences" are meant sequences, 
which are obtained by substitution, deletion and/or insertion 
of one or more and/or smaller groups of nucleotides of the 
sequences mentioned above and/or by truncation or extension 
at the 5' and/or 3' terminus. Modifications within the DNA 
sequence may lead to derived DNA sequences , which encode 
amino acid sequences being identical to the amino acid 
sequence encoded by the starting DNA sequence, or to such 



- 10 - 



sequences, in which, compared to the amino acid sequence, 
which is encoded by the starting DNA sequence, single or a 
few amino acids are altered, i.e. substituted, deleted and/or 
inserted, as well as to such sequences, which - optionally in 
addition - are truncated and/or extended at the C-terminus 
and/or N-terminus. 

Furthermore, the present invention also extends to the comple- 
mentary sequences of the genes and DNA sequences according to 
the invention, as well as the RNA transcription products 
thereof . 

Particularly comprised by the present invention are all 
sequences derived according to the above provisions, which 
over their entire length or only with one or more partial 
regions hybridize under stringent conditions to the starting 
sequences mentioned above or to the sequences complementary 
thereto or to parts thereof, as well as DNA sequences com- 
prising such sequences. 

By the term "hybridization under stringent conditions" in the 
sense of the present invention is understood a hybridization 
procedure according to one or more of the methods described 
below. Hybridizing: up to 20 h in PEG buffer according to 
Church and Gilbert (0.25 M Na 2 HP0 4 , ImM EDTA, 1% (w/v) BSA, 
7% (w/v) SDS, pH 7.5 with phosphoric acid; ref. 22) at 42°C 
or in standard hybridization buffers with formamide at 42 °C 
or without formamide at 68°C (ref. 23). Washing: 3 times at 
65°C for 30 min in 3x SSC buffer (ref. 23), 0.1% SDS. 

In the sense of the present application, the term 
"hybridization" always means hybridization under stringent 
conditions, as mentioned above, even if this is not explici- 
tely indicated in the individual case. 



Moreover, 
sequences 



the invention relates to fragments of the DNA 
mentioned above, including the DNA sequences 
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derived in accordance with the above provisions, to fragments 
derived from such fragments by nucleic acid substitution, 
insertion and/or deletion as well as the corresponding frag- 
ments with sequences complementary thereto. Such fragments 
are i.a. suitable as sequencing or PCR primers, screening 
probes and/or for uses as discussed below. For the use as a 
screening or hybridizing probe, the DNA fragments according 
to the present invention are frequently employed as radio- 
labelled fragments. Fragments carrying sequences, which are 
derived from the starting sequences defined above by substi- 
tution, deletion and/or insertion of one or more nucleotides, 
and the sequences complementary thereto, respectively, are 
comprised by the invention to that extent, as said fragments 
hybridize under the above mentioned stringent conditions to 
the starting sequences, or to the sequences complementary 
thereto, respectively. 

On the basis of the DNA sequences mentioned in the sequence 
protocol and in Figure 2, DNA fragments according to the 
invention may for example be obtained starting from plant DNA 
by means of restriction endonucleases using appropriate 
restriction sites or by employment of PCR by means of primers 
appropriately synthesized, or may, as an alternative, also be 
chemically synthesized. Such techniques are well-known to 
those skilled in the art. 

Moreover, the invention relates to any DNA sequences, which 
represent a gene or are a part of a gene encoding the enzyme 
N-acetyl glucosaminyl transferase I and, which in their 
entirety or in a partial region thereof hybridize under 
stringent conditions 

- to one or more of the DNA sequences of the invention and/or 

- to one or more of the DNA fragments of the invention and/or 

- to a DNA sequence, which is derived from the amino acid 
sequences mentioned in the sequence protocol considering the 
degeneration of the genetic code. 
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For this purpose, hybridization or screening probes are used 
as DNA fragments, which generally comprise at least 15 
nucleotides, typically between 15 and 30 nucleotides, and, if 
necessary, substantially more nucleotides. As an example, the 
primers employed in Example 1 may be used. Alternatively, DNA 
sequences of appropriate length, derived from the DNA sequen- 
ces mentioned in the sequence protocol, may be used. As a 
third possibility, appropriate hybridization probes according 
to the invention may be developed starting from the amino 
acid sequences mentioned in the sequence protccol considering 
the degeneration of the genetic code. 

In this respect, a subject-matter of the present invention 
are also genes encoding N-acetyl glucosaminyl transferase I, 
which may be detected from other varieties or plant species 
on account of the hybridization thereof to above mentioned 
hybridization probes, as well as DNA sequences, DNA fragments 
and constructs, which are derived therefrom in accordance 
with the above provisions. 

The isolation of the corresponding gene and sequencing thereof 
following detection by means of the hybridization probes of 
the invention are well within the skills of a specialist in 
this field, and are detailed by way of example with respect to 
N-acetyl glucosaminyl transferase I from Solanum tuberosum and 
to the corresponding enzymes from Nicotiana tabacum and 
Arabidopsis thaliana (partial sequence) in the examples. 

Finally, another subject matter of the present invention are 
antisense sequences with respect to any of the above DNA 
sequences . 

iv) Constructs 

Also comprised by the invention are constructs, which may 
optionally comprise besides additional 5' and/or 3' sequen- 
ces, e.g. linkers and/or regulatory DNA sequences or other 
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modifications, the DNA sequences of the invention, including 
the DNA sequences derived as detailed above. 



An example for this are hybridization or screening probes, 
which in addition to a DNA sequence of the invention also 
comprise a detection agent for the verification of hybridi- 
zation products, which in this case typically is non- 
radioactive, e.g. fluorescent or phosphorescent molecules, 
biotin, biotin derivatives, digoxigenin and digoxigenin deri- 
vatives. In this respect, however, radioactive or non- 
radioactive detection agents may be considered, which may be 
attached to the DNA sequence according to the present inven- 
tion e.g. by means of end labelling. 



A subject-matter of the invention are also antisense and 
sense constructs with respect to the DNA sequences and frag- 
ments according to the present invention, i.e. with respect 
to 

the DNA sequences mentioned in the sequence protocol and 
the corresponding genes; 

the DNA sequences derived therefrom in accordance with 
the above provisions; 

one or more regions of these DNA sequences; 
DNA sequences, especially from other varieties or plant 
species, which represent a gene or are a part of a gene, 
encoding the enzyme N-acetyl glucosaminyl transferase I; 
and which hybridize under stringent conditions 

to one or more of the above DNA sequences and/or 
to one or more of the above DNA fragments and/or 
to a DNA sequence, which is derived from the 
amino acid sequences mentioned in the sequence 
protocol considering the degeneration of the 
genetic code. 



Furthermore, the present invention extends to any 
fer systems such as vectors, plasmids, viral 
genomes or cosmids, which contain the DNA sequence 



s 



DNA-trans- 
and phage 
according 
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to the present invention, e.g. the GntI gene, cDNA and DNA 
regions according to the invention, as mentioned in the 
sequence protocol, fragments thereof, in particular antisense 
or sense constructs and/or cDNA sequences derived therefrom 
according to the above provisions. 

Various techniques for the production or synthesis of DNA, 
DNA fragments, constructs and transfer systems according to 
the invention, e.g. digestion by means of restriction endo- 
nucleases, PCR amplification using suitable primers, optio- 
nally followed by cloning and additional chemical or enzyma- 
tic modification starting from plant DNA are well-known to 
those skilled in the art. 

One possibility of application of the DNA hybridization 
probes according to the invention is the detection of N-ace- 
tyl glucosaminyl transferase I genes in plants other than 
those, from which the DNA sequences mentioned in the sequence 
protocol were obtained, or the detection of potential (other) 
isoforms of the N-acetyl glucosaminyl transferase I gene in 
the starting plants Solanum tuberosum, Nicotisna tabacum and 
Arabidopsis thaliana. 

If it is possible to make use of a plant genomic library or 
cDNA library for the hybridization experiment, a positive 
hybridization result of such screening of each library may 
indicate a clone or a few clones, which contain the desired 
sequence completely or in part, i.e. the N-acetyl glucosa- 
minyl transferase I gene, combined with only a limited amount 
of other DNA from the genome of the targe: plant, which 
appropriately facilitates cloning and sequencing of the tar- 
get gene. As an alternative, a PCR amplification of the gene 
or parts thereof may also be carried out starting from plant 
DNA and suitable constructs, so-called PCR primers, to faci- 
litate cloning and sequencing. 
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One use of sequencing primers of the invention, which are 
synthesized starting from suitable regions of the sequences 
according to the invention, e.g. enables genomic sequencing 
starting from the entire target plant genomic DNA cleaved by 
restriction endonucleases, by means of the Church-Gilbert 
technique, as well as sequencing at the cDNA level followxng 
RT-PCR amplification of the total RNA of the target plant 
(cf . Expl. 1) • 

A n alternative possibility of application of the DNA hybridi- 
zation probes according to the present invention derived from 
the DNA sequences mentioned in the sequence protocol, xs the 
use thereof according to the invention for the detection of 
plants with reduced or lacking N-acetyl gluccsaminyl trans- 
ferase I activity. The hybridization experiment serves to 
detect the N-acetyl glucosaminyl transferase I (GntI) gene by 
which it may be concluded, e.g. owing to a negative hybrxdx- 
zation result under stringent conditions, that the GntI gene, 
and thus, N-acetyl glucosaminyl transferase I activity xn a 
plant investigated is lacking. 

Such hybridization techniques for the detection of proteins 
or genes particularly in plant material by means of DNA 
probes are also known to the persons skilled in the art In 
this context, it is referred to the above statements under 
item iii) for possible hybridization conditions. Generally, 
suitable DNA hybridization probes comprise at least 15 
nucleotides of a sequence, which for example is derived from 
the cDNA sequences mentioned in Fig. 2 and the sequence pro- 
tocol or from the corresponding GntI genes. 

v) Transformed microorganisms 

Furthermore, the invention relates to microorganisms, such as 
bacteria, bacteriophages, viruses, unicellular eukaryotxc 
organisms, such as fungi, yeasts, protozoa, algae, and human, 
animal and plant cells, which have been transformed by one or 
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more of the DNA sequences of the invention or one or more of 
the constructs of the invention, as illustrated above. 

Transformed microorganisms according to the present invention 
are used e.g. as expression systems for the transforming 
foreign DNA to obtain the corresponding expression products. 
For this purpose, typical microorganisms are bacteria, e.g. 
such as E . coli. Furthermore, transformed microorganisms 
according to the invention, in particular agrobacteria, may 
be employed e.g. for the transformation of plants by trans- 
mission of the transforming foreign DNA. 

Methods for the transformation of cells of microorganisms by 
(foreign) DNA are well-known to those skilled in the art. 

For this purpose, e.g. constructs referred to as expression 
vectors are used, which contain the DNA sequence of the 
invention under control of a constitutive or inducible promo- 
ter, which, if necessary, is additionally tissue specific, so 
as to enable the expression of the introduced DNA in the tar- 
get or host cell. 



Therefore, a further aspect of the invention is a method for 
the production of the enzymes and proteins of the invention 
by using one or more of the transformed microorganisms of the 
present invention. The method comprises cultivating at least 
one microorganism transformed by the DNA of the invention, in 
particular by one of the cDNAs mentioned in the sequence pro- 
tocol, under the control of an active promoter, as defined 
above, and isolating the enzyme of the invention from the 
microorganisms, and, if applicable, also from the culture 
medium. It is understood, that this method also relates to 
the production of enzymes and proteins, respectively, which 
are derived from the enzymes according to the present inven- 
tion from Solanum tuberosum, Nicotiana tabacum and Arabidop- 
sis thaliana, as defined under i) above. 
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Methods for the cultivation of transformed microorganisms are 
well-known to those skilled in the art. For example, the iso- 
lation of the expressed enzyme may be employed according to 
the method described in Example 5 by means of metal-chelate 
chromatography or, alternatively, by chromatography via 
columns, which contain the antibodies against the enzyme 
bound to the packing material. 



vi) Transgenic plants 



Furthermore, the invention comprises transgenic plants, which 
are transformed by means of a DNA sequence according to the 
invention or a corresponding construct, respectively. Accor- 
dingly, there may be obtained e.g. transgenic plants, in 
which a GnTI deficiency, for example on account of a missing 
or defectice GntI gene or due to defects in the regulatory 
regions of this gene, has been removed by complementation 
using a construct derived from the cDNA sequences mentioned 
in the sequence protocol, wherein the expression of said con- 
struct is under the control of an active constitutive or 
inducible promoter, which may be additionally tissue speci- 
fic. In this case, the GnTI enzyme or protein expressed on 
account of the DNA of the invention contained in the con- 
struct and having GnTI activity complements the GnTI activity 
missing in the starting plant. 



Also considered are transgenic plants, in which the GnTI 
activity already present in the starting plant is increased 
by additional expression of the GntI transgene introduced by 
means of a construct according to the present invention. Up 
to now, the extremely low expression of the GntI gene in vivo 
accompanied by extremely low enzyme activity, which corre- 
spondingly was very difficult to detect, has been a main 
problem in the investigation of the enzyme N-acetyl glucosa- 
minyl transferase I in plants. The problem of a too low GnTI 
enzyme activity in plants may be overcome by the coexpression 
of a DNA according to the present invention. 
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In this case, it may be preferable for the transformation of 
plants to employ DNA according to the invention, additionally 
comprising a sequence region, which following expression 
enables a facilitated detection and/or enrichment and purifi- 
cation, respectively, of the protein produce having GnTI 
activity. This is for example accomplished by the use of a 
specific DNA sequence for the expression of a recombinant 
GnTI enzyme, said sequence carrying a N-terminal or C-termi- 
nal sequence extension encoding an affinity marker. If it is 
additionally intended to provide an amino acid sequence por- 
tion between the GnTI enzyme and the affinity marker, which 
represents a recognition site for a specific protease, clea- 
vage of the N-terminal or C-terminal sequence extension from 
the GnTI enzyme may be achieved by the subsequent use of this 
specific protease, and the GnTI enzyme thereby obtained in 
isolated form. 

An example for this is the use of a DNA sequence according to 
the present invention, which codes for the recombinant GnTI 
enzyme with a C-terminal sequence extension, encoding the 
affinity marker AWRHPQFGG (strep-tag; ref. 39), and an inter- 
vening protease recognition site IEGR. The expression of the 
DNA according to the present invention provides GnTI enzymes 
with the C-terminal sequence extension mentioned, by means of 
which the expressed protein molecules specifically bind to a 
streptavidin derivatized matrix, and may thus be isolated. 
Then, by means of the protease factor Xa specifically 
recognizing the amino acid sequence IEGR, the GnTI portion of 
the protein molecules may be released. As an alternative, the 
complete protein may be removed from the streptavidin deriva- 
tized matrix by means of biotin or biotin derivatives. 

A further example is represented by DNA sequences of the inven- 
tion, encoding a protein which comprises multiple, e.g. 10, N- 
terminally added histidine residues (His-tag) in addition to a 
GnTI enzyme. Due to the N-terminal histidine residues, isola- 
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tion or purification, respectively, of the proteins expressed 
may be easily conducted by metal-chelate affinity chromato- 
graphy (e.g. Ni sepharose) (cf . also Example 5) . 

Moreover, the invention comprises portions of such transgenic 
plants, adequately transformed plant cells, transgenic seeds 
and transgenic reproduction material. 

A further important aspect of the invention is the use of the 
sequence information discussed above for the production of 
plants having reduced or lacking N-acetyl glucosaminyl trans- 
ferase I activity. 



The possibilities of identifying plants with reduced or lack- 
ing N-acetyl glucosaminyl transferase I activity due to a 
gene defect or a missing gene by means of antibodies of the 
invention or screening or hybridization probes of the inven- 
tion have already been described above. 



Two additional possibilities reside in the use according to 
the invention of antisense or sense constructs, respectively, 
which are derived from the DNA sequence of a plant GntI gene, 
for the production of transgenic plants with reduced or lack- 
ing N-acetyl glucosaminyl transferase I activity by means of 
homology-dependent gene silencing (cf. ref. 16,17). The DNA 
sequence used as a starting sequence for the generation of 
the constructs, may be derived from the starting plant to be 
transformed itself but also from a different plant variety or 
species. In particular, antisense or sense constructs as 
discussed under items iii) and iv) above are of use. 
Generally, the constructs employed comprise at least 50 to 
200 and more base pairs. 



In particular, the constructs employed for this purpose 
comprise at least 50 to 200 and more base pairs, with a 
sequence, which is derived on the basis of 



- 20 - 



- the cDNA sequences mentioned in the sequence protocol 
and/or the corresponding GntI genes and/or 

- the derived DNA sequences discussed above and/or DNA frag- 
ments according to the present invention and/or 

- the DNA sequences, in particular from other varieties and 
plant species, which encode N-acetyl glucosaminyl transferase 
I and which may be identified due to a hybridization under 
stringent conditions to hybridization or screening probes, as 
defined under items iii) and iv) above. 

Generally, the constructs contain a strong constitutive or 
inducible promoter, which additionally may be tissue speci- 
fic, by means of which the antisense or sense DNA sequence 
regions are controlled. 

In the production of transgenic plants by integration of 
antisense construct (s) into the plant genome or by viral 
infection of starting plants or plant cells by means of virus 
containing antisense construct (s) for an extrachomosomal pro- 
pagation and transcription of the antisense construct or the 
antisense constructs in infected plant tissue, it is intended 
to achieve a hybridization of Gntl-gene transcripts to 
transcripts of the antisense DNA region at the RNA level, 
which prevents translation of the GntI mRNA. The result is a 
transgenic plant with strongly decreased contents of N-acetyl 
glucosaminyl transferase I, and thus, a strongly decreased 
corresponding enzyme activity. 

For the transformation of plants according to the invention 
with antisense constructs, for example constructs may be 
employed, which hybridize to one of the complete cDNAs, 
mentioned in Fig. 2 and in the sequence protocol, or to 
corresponding regions thereof, generally comprising at least 
50 to more than 200 base pairs. Moreover, particularly pre- 
ferred is the use of fragments, the transcripts of which 
additionally cause a hybridization to a portion of the 5' un- 
translated region of the GntI mRNA, at which or in the proxi- 
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mity of which usually the attachment of ribosomes would 
occur. Examples of such constructs are shown in Fig. 4. 

In view of the occurence of an isoform in Solanum tuberosum, 
which probably is located in the cytoplasm due to lack of the 
membrane anchor (aa 10 to 29) of yet unkown function, it may 
be desirable to target only the N-acetyl glucosaminyl trans- 
ferase I enzyme located in the Golgi cisternae, i.e. only 
that enzyme comprising the membrane anchor. One reason for 
this desire may be the effort or, in the individual case, 
also the requirement, to affect as little as possible the 
cytoplasmatic metabolism of the plant cell, for which the 
cytoplasmatic N-acetyl glucosaminyl transferase I possibly is 
of importance. For this purpose, antisense constructs may be 
used according to the present invention, which themselves or 
the transcripts of which, respectively, hybridize to a DNA or 
RNA region of the GntI gene or the GntI mRNA, comprising a 
part of the 5' untranslated region and the coding region 
including the membrane anchor. Generally, the extension of 
the region of hybridization up to position 266 of the cDNA in 
Fig. 2 and SEQ ID NO: 1 is considered harmless for the 
purpose mentioned above. 

In the production of transgenic plants by integration of sense 
constructs into the plant genome or by viral infection of star- 
ting plants or plant cells by means of virus containing sense 
construct (s) for extrachromosomal propagation and expression of 
the construct or constructs in infected plant tissue, there are 
assumed hybridization phenomena in tobacco according to the 
work of Faske et al. (ref . 17), of said constructs to the endo- 
genous GntI gene at a posttranscriptional or DNA level, re- 
spectively, which finally affect or prevent the translation of 
the GntI gene. Also in this case, the result are transgenic 
plants having reduced or even lacking N-acetyl glucosaminyl 
transferase I activity. 
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Methods for the stable integration of such antisense and 
sense constructs into the genome of plants, or for the viral 
infection of plants or plant cells, respectively, for an 
extrachromosomal propagation and transcription/expression of 
such constructs in infected plant tissue are well known to 
those skilled in the art. This includes the direct DNA trans- 
fer (e.g. into protoplasts by means of electroporation or by 
the addition of a high molecular osmotic agent as well as 
biolistic methods, by which DNA coated particles are shot 
into the plant tissue) , such as the use of natural 
host/vector systems (e.g. agrobacteria or plant viruses). For 
viral infection of starting plants or plant cells by viruses 
containing appropriate constructs for extrachromosomal propa- 
gation and transcription/expression of the constructs in 
infected plant tissue, a variety of specific viruses, such as 
tobacco mosaic virus (TMV) or potato virus X, is available. 

Representative plants, which are suitable for such integra- 
tion, comprise dicotyledonous as well as monocotyledonous 
cultivated plants, in particular Solanaceae such as potato, 
tobacco, tomato and pepper. Additionally, banana, alfalfa, 
canola, beets, soybean, lettuce, corn, rice and grain, would 
be suitable target plants for the use of homologous antisense 
constructs. For example, the sequence from Arabidopsis tha- 
liana mentioned in the sequence protocol appears to be 
particularly suitable as a starting sequence for the trans- 
formation according to the invention of Brassicaceae, such as 
canola plants, by means of sense or antisense constructs. 
Further plants of interest are any plants, which express gly- 
coproteins of interest for medicine and research. 

Generally, it should be noted, that the transformation accor- 
ding to the invention of plants, which in the corresponding 
region of the GntI gene exhibit a homology of > 70% at the 
nucleotide level to the employed antisense or sense constructs 
according to the present invention, typically results in 



- 23 - 



■ r Dlants 0 f the invention, which show the desired 
transgenic plants or me , rtivitv 
reduction of N-acetyl glucosaminyl transferase I activity. 

Further, another possibility is seen in the targeted de- 
duction , k noc k -out, of the N-acetyl glucosaminyl ran - 

"erase I gene via gene targeting by means of homologous 
, f ?zn in a target plant using a suitable 
recombination (ref. 24) m a rarg v DreS ent 
DNA fragment derived from the cDNA sequence of the present 
invention, similar to the procedure established for yeast 
systems and mammals. 

F urther, the present invention comprises transgenic plants, 
which have been transformed by the antrsense or sense con 
structs mentioned above or the viruses conta.nrng the same 
, as oarts of such transgenic plants, 

^^^^^ ceiis, trans^c seeds and 
transgenic reproduction material. 

Methods of the production of transgenic plants, * ~™ 

harfpria . or virus-mediated as well as direct una 
: anX a C I oJ to those billed in the art. Concerning 
representative plants for such a transformation, the above 

mentioned applies. 

The plants of the invention and the plants obtained according 
The plan reS pectively , with reduced or lacking N- 

to the invention, respectively, 

acetyl glucosaminyl transferase I activity, may be used 
according to the invention for the production of glyco- 
; ns with minimal and uniform, i.e. defined, sugar 

^ ,K^r^ Quch Glycoproteins are of great 
H 11PS As discussed above, sucn g^y^ 

Stance for medicine and research. As a reasonable source 
Traw material and food as well as due to their unprob ema- 
tical disposal via composting, plants per se ^ 
oioreactors. According to the present invention, it is now 
Hie to express biotechnologically or pharmaceutical^ 
possible to express . hpraoe utics of low antigenic 

relevant glycoproteins (e.g. therapeutics 
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potential for mammals) in cultivated plants, in which GnTI 
activity is highly reduced or completely absent. 

Accordingly, the invention also comprises a method for the pro- 
duction of glycoproteins with minimal uniform and defined sugar 
residues, comprising cultivating a transgenic plant according to 
the invention, of parts of such plants or of plant cells 
transformed according to the invention, each expressing the 
desired glycoprotein, as well as isolating the desired glyco- 
protein from the cultivated material. 

In this context, representative cultivated plants are 
Solanaceae, in particular potato, tobacco, tomato and pepper. 
Furthermore possible are banana, alfalfa, canola, beets, soy- 
bean, lettuce, corn, rice and grain. 

The sequence of the enzymatically controlled and plant speci- 
fic N-glycan modifications, which secretory glycoproteins are 
subjected to during passage through the Golgi apparatus of 
higher plants, is schematically shown in Fig. 1. The biosyn- 
thesis block due to lacking or insufficient N-acetyl glucosa- 
minyl transferase I (GlcNAc transferase I) activity in a 
plant leads, instead of complex glycans, to the predominant 
formation of glycans of the Man 5 GlcNAc 2 type, i.e. glyco- 
proteins with uniform and well-defined sugar residues, which 
are of extremely high importance for medicine and research. 

For this purpose, the genes encoding the desired glyco- 
proteins may be expressed in their natural producing plants, 
which have been transformed according to the present inven- 
tion e.g. by means of antisense or sense constructs to yield 
transgenic plants with reduced or missing N-acetyl glucosa- 
minyl transferase I activity. 
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There is also the possibility to use transgenic plants of the 
invention displaying reduced or lacking N-acetyi glucosaminyl 
transferase I activity, which additionally have been trans- 
formed by the gene encoding the desired glycoprotein. In 
order to achieve this, constructs may be employed, which con- 
tain the gene encoding the desired glycoprotein under the 
control of a strong constitutive or inducible promoter, which 
is optionally tissue specific as well, and lead to the inte- 
gration of the gene into the plant genome. Alternatively, the 
transformation may also be conducted by viral infection by 
means of a virus containing the gene for the desired glyco- 
protein for extrachromosomal propagation and expression of 
the gene. The glycoprotein may then be expressed in the 
respective host plant and obtained therefrom. 

Naturally, as an alternative, the procedure may be such, that 
initially a transformation using an expression construct or 
virus containing the DNA encoding the glycoprotein is per- 
formed, and subsequently, another transformation with one or 
more of the antisense or sense constructs of the invention or 
with one or more viruses, containing the corresponding DNA, 
is performed. It is also possible to perform a simultaneous 
transformation using both constructs or using one virus con- 
taining the antisense or sense construct as well as the gene 
encoding the desired glycoprotein (piggyback version) . 

Within the scope of the present invention, there is also con- 
sidered a viral overinf ection of the transgenic plants accor- 
ding to the invention, in which integration of an anti- 
sense/sense construct and/or the gene encoding the desired 
glycoprotein into the genome has already occured, by viruses 
containing the antisense/sense construct and/or the gene 
encoding the desired glycoprotein, for an additional 
extrachromosomal propagation and transcription or expression, 
respectively, of this DNA. As a result, the concentrations of 
antisense or sense DNA, respectively, or of the expressed 
glycoprotein may be increased in the transgenic plant cells. 
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It may prove to be practical for the production according to 
the invention of glycoproteins with defined glycosylation, to 
use tissue specific promoters in such cases, where it is 
intended to obtain the desired glycoproteins specifically 
only from certain parts of a plant such as tubers or roots. 
Today, for a large variety of plant tissues, tissue specific 
promoters are available, which drive expression of foreign 
genes specifically only in these tissues. By way of example, 
tuber specific promoters such as patatin class I (ref. 26) 
and proteinase inhibitor II promoters (ref. 27) may be men- 
tioned. Under certain conditions, both promoters exhibit 
expression also in leaf tissue, i.e. they can be induced by 
high metabolite contents (for example sucrose) and in the 
case of the proteinase inhibitor II promoter also by mechani- 
cal lesion or by spraying with abscisic or jasmonic acid, 
respectively. 



The use of tissue specific promoters may also be indicated in 
cases, where the DNA sequence or the transcription products 
or translation products thereof according to the invention, 
respectively, which are employed for the transformation, turn 
out to be detrimental to certain plant parts,, e.g. due to a 
negative influence on the metabolism of the corresponding 
plant cells. 



As a representative target glycoprotein, human glucocerebro- 
sidase may be used for the therapy of the hereditary 
Gaucher's disease (ref. 25). In order to obtain human 
glucocerebrosidase (GC) with uniform and defined sugar resi- 
dues, e.g. plants of the present invention which are trans- 
formed by means of antisense DNA, may be transformed with the 
gene encoding human glucocerebrosidase. For this purpose, the 
human glucocerebrosidase cDNA sequence (ref. 38) is modified 
at the 3' terminus by means of PCR using gene specific 
primers in a manner, that the recombinant enzyme carries a C- 
terminal sequence extension encoding an affinity marker (e.g. 
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AWRHPQFGG, strep-tag; ref . 39) and, optionally, also a pro- 
tease recognition site (e.g. IEGR) between the GnTI enzyme 
region and the affinity marker. The GC-cDNA sequence thus 
altered is expressed in GntI antisense plants of the present 
invention by using a strong and optionally cissue specific 
promoter (e.g. for potato under the control of the tuber 
specific B33 patatin promoter) , so that the enzyme synthe- 
sized in these plants exclusively carries well defined N-gly- 
cans. The affinity marker is intended to facilitate the en- 
richment of the recombinant enzyme from the transgenic 
plants. In this case, the expressed protein molecules (GC- 
strep molecules) bind to a streptavidin derivatized matrix 
via the affinity marker sequence and can be released there- 
from by means of biotin or biotin derivatives. The removal 
from the strepatavidin derivatized matrix may also be carried 
out by means of catalytic amounts of a protease, which exhi- 
bits a specificity for the protease recognition site located 
between the GnTI enzyme region and the affinity marker. In 
this case, only the GnTI enzyme region is released from the 
matrix. This could be advantageous especially in that case, 
if the affinity marker sequence has a detrimental effect on 
the GnTI activity. 

Due to their terminal mannose residues, the Man 5 GlcNAc 2 -gly- 
cans of the glucocerebrosidase obtained from the plants of 
the present invention will be recognized by macrophages as an 
uptake signal, and can thus directly be employed for the 
therapy of hereditary Gaucher 1 s disease. Currently, a therapy 
is only possible upon expensive isolation and deglycosylation 
of native glucocerebrosidase (ref. 25) . 

Accordingly, the production of recombinant glycoproteins may 
be highly facilitated by the use of plant GntI sequences com- 
pared to conventional methods, e.g. the chemical deglycosyla- 
tion of purified glycoproteins, which is technically deman- 
ding (ref. 25), or a difficult and expensive production in 
GnTI deficient animal cell lines (ref. 7,10). 



- 28 - 



Description of the figures: 

Fig. 1: Sequence of plant specific N-glycan modifications, 
which secretory glycoproteins are subject to during 
passage through the Golgi apparatus of higher plants 

(ref. 28). The biosythesis block to complex modified 
glycans is based on a deficiency in GnTI activity 

(which is either caused by a defective or missing 
GnTI enzyme or by effective reduction of the GntI 
gene expression) and is indicated by a cross. Meaning 
of the symbols: (F) fucose residues, (X) xylose resi- 
dues, (•) GlcNAc residues, (□) mannoss residues. 

Fig. 2: Full length cDNA sequence of a plant GnTI from potato 
(Solanum tuberosum L.) and amino acid sequence deduced 
therefrom. By way of example, the complete cDNA of the 
membrane anchor containing GntI isoform from potato 
leaf tissue (Al) is illustrated. The EcoRI/NotI 
linkers at the 5' and 3' ends of the cDNA are 
highlighted by bold letters, the binding sites of the 
degenerate oligonucleotides used for obtaining the RT- 
PCR probe are underlined. In contrast to already 
published animal GnTI sequences, the protein sequence 
derived from the potato cDNA clones contains a 
potential N-glycosylation site: Asn-X (without Pro)- 
Ser/Thr, which is indicated by an asterisk. The region 
of the membrane anchor is highlighted in italics (aa 
10 to 29). The start of the isoform (A8), which is 
potentially located in the cytosol, is indicated by an 



Fig. 3: A, Degree of identity or similarity, respectively, of 
the amino acid sequence deduced from a complete GntI 
cDNA sequence from potato (Al) in comparison to other 
GnTI sequences of animal organisms, which have been 
selected from data bases. Identical amino acid posi- 
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tions (in %) are printed in bold letters, similar 
amino acid positions are given in brackets under- 
neath. Meaning of the abbreviations: Hu, human; Ra, 
rat; Mo, mouse; Ce, Caenorhabditis elegans (round- 
worm) ; St, Solanum tuberosum (potato) . 

B, Comparison of the derived amino acid sequences of 
different plant Gntl-cDNA clones. A_Stb-Al, GnTI from 
potato leaf; B_Ntb-A9, GnTI from tobacco leaf (A9) ; 
C Atb-Full, GnTI from Arabidopsis thaliana. Identical 
aa are highlighted in black, similar aa in light 
grey. 

Fig. 4: Cloning strategy of the G/itl-antisense constructs 
used. Following fill-in of the ends, a NotI linker 
was introduced into the Sail restriction site of the 
polylinker region of the plant expression vector pA35 
(=pA35N) (ref. 29), and the complete Al-Gntl-cDNA was 
inserted into pA35N via NotI . The corresponding anti- 
sense construct (=pA35N-Alas) was inserted into 
binary vector pBinl9 (ref. 30) via EcoF.I and Hindlll. 
Additionally, following PCR amplification, a 5' frag- 
ment of the Al-GntJ-cDNA comprising 270 bp was cloned 
into pA35N via Xbal and NotI restriction sites in 
antisense orientation (=pA35N-Al-short ) and also 
inserted into pBinl9. Abbreviations; Numerals in 
brackets, positions of the restriction sites in the 
Al-Gnt J-cDNA (in base pairs); pBSK, cloning vector 
(Stratagene) : pGEM3Z, cloning vector (Promega) ; 
CaMVp35S, constitutive 35S promoter of cauliflower 
mosaic virus; OCSpA, polyadenylation signal of octo- 
pin synthase; pNOS, promoter of nopaline synthase; 
NEO, neomycin phosphotransferase (selection marker, 
confers kanamycin resistance) ; NOSpA, polyadenylation 
signal of nopaline synthase; LB/RB, left/right border 
of the T-DNA of the binary vector; arrow, translation 
initiation (ATG) ; A8 , start of the GnTI isoform, 
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which is potentially located in the cytosol (7 aa 
substitutions in comparison to Al) . 

Fig. 5: Extent of suppression of complex glycoprotein 
modification in transgenic potato plants transformed 
with the long GntI antisense construct (cf. Fig. 4). 
A, Coomassie-stained SDS gel from leaf extracts; B, 
Western-blot analysis (Ref. 13,33) of parallel 
samples developed with a complex-glycan antiserum 
(Ref. 12,13). The lanes contain 30 \iq each of total 
protein: cgrl(Ara), Arabidopsis cgl mutant (Ref. 13); 
WT(Desi), wild-type potato; the numerals refer to 
individual transgenic potato plants; the arrows 
represent molecular weight standards of 66, 45, 36 
and 29 kDa, respectively. 

Fig. 6: Detection of specificity of the generated GnTI 
antiserum following cell fractionation (Ref. 40) of 
tobacco callus material. For Western-blot analysis 
(Ref. 13,33) 30 ug of protein were applied per lane. 
The antiserum was used in 1:1000 dilution. Lane 1, 
homogenate following separation of cellular debris; 
lane 2, vesicle fraction following column chromato- 
graphy; lane 3, sucrose gradient fraction I 
(microsomes); lane 4, sucrose gradient fraction II 
(plastids) ; lane 5, antigen used for immunization 
(recombinant GnTI fusion protein) ; arrow, molecular 
weight of about 4 9 kDa. 



Explanation of the abbreviations used in the text: 

Aa, amino acid(s); bp, base pair(s); EMS , Ethyl methane sul- 
fonate (mutagenic agent); F2, second filial generation; Fuc, 
fucose; Glc, glucose; GlcNAc, N-acetyl glucosamine; GnTI, N- 
acetyl glucosaminyl transferase I (EC 2.4.1.101); GntI, gene 
for GnTI (nuclear encoded) ; kDa, kilodalton; Man, mannose; 
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PCR, polymerase chain reaction; PAGE, polyacrylamide gel 
electrophoresis; ref., reference; RT-PCR, reverse transcrip- 
tion coupled polymerase chain reaction; SDS, sodium dodecyl 
sulfate; var., variety; Xyl, xylose. 

In the following, the invention will be described in more detail 
by means of examples, which are only intended to illustrate the 
invention and shall not limit the invention in any manner. 



Example 1 

Isolation and characterization of plant GntI cDNA clones. 

Total RNA was isolated from potato and tobacco leaf tissue, 
and cDNA fragments of about 90 bp were amplified by means of 
RT-PCR in combination with degenerate primers (procedure ana- 
logous to ref. 31), which were derived from conserved amino 
acid regions of known GnTI sequences frome animal organisms 
(sense primer 1*, 5'-TG(CT) G(CT)I (AT) (GC) I GCI TGG 
(AC)A(CT) GA(CT) AA(CT)-3'; antisense primer 3*, 5' -CCA ICC 
IT (AG) ICC (ACGT)G(CG) (AG) AA (AG) AA (AG)TC-3' ; 30 pmol of 
each primer per 50 ul PCR assay at an annealing temperature 
of 55°C and 45 cycles). Following gel eluticn, the ends of 
the PCR products were repaired (i.e. blunt ended using DNA 
polymerase I and phosphorylated using T4 polynucleotide 
kinase) and cloned into the EcoRV restriction site of pBSK 
(Stratagene) . By comparison with known GnTI sequences between 
the primers (arrows), the identity of the derived amino acid 
sequences from the potato and tobacco RT-PCR products could 
be confirmed as being homologous; =>Q (R/M) QFVQDP ( D/Y) ALYRSc= 
(homologous aa are underlined) . Of one clone each, radio- 
labeled probes were synthesized by means of PCR (standard 
PCR assay using degenerate primers as above, nucleotide mix- 
ture without dCTP, but instead with 50 uCi cc- 32 P-dCTP [>3000 
Ci/mMol]), and different cDNA libraries were screened for 
GntI containing clones using the corresponding homologous 
potato or tobacco probes, respectively (procedure analogous 
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to ref. 31; the stringent hybridization conditions have 
already been described in the text above) . The cDNA libraries 
were prepared from mRNA of young and still growing plant 
parts (sink tissues). Following cDNA synthesis and ligating 
EcoRI/NotI adaptors (cDNA synthesis kit, Pharmacia) EcoRI 
compatible lambda arms were ligated, those packaged and used 
to transfect E. coli XL1 Blue cells (Lambda ZAPII cloning and 
packaging system, Stratagene) . Following amplification of the 
libraries, one full-length GntI clone each was isolated from 
a potato leaf sink library (Al according to Fig. 2 and SEQ ID 
NO: 1) and a tobacco leaf sink library (A9 according to SEQ 
ID NO: 3), as well as two additional clones from a tuber sink 
library (A6, A8) . The deduced GnTI amino-acid sequences con- 
tain a potential N-glycosylation site, Asn-X (without Pro) - 
Ser/Thr, in contrast to those of animals. One of the tuber 
GntI cDNA sequences carries stop codons in all three reading 
frames in front of the first methionine (A8). The coding 
region shows high homology to the longer tuber clone (A6) 
(only 2 aa substitutions), but displays a completely diffe- 
rent 5' non-translated region. Furthermore, the membrane 
anchor characteristic for the Golgi enzyme is missing, so 
that this GnTI isoform might be located in the cytosol. 
Sequence comparisons carried out by means of the gap or 
pileup option, respectively, and the box option of the gcg 
software package (J. Devereux, P. Haeberli, 0. Smithies 
(1984) Nucl. Acids Res. 12: 387-395) indicate, that the de- 
duced plant GnTI amino-acid sequences exhibit only 30-40% 
identity and 57-59% similarity to those of animal organisms 
(Fig. 3A), while they are highly homologous among each other 
(75 - 90% identity, Fig. 3B) . 

The procedure in the case of Arabidopsis thaliana was analo- 
gous, wherein for the preparation of a specific probe first a 
partial GntI sequence was amplified by RT-PCR using GntI 
sense primer 4A (5' -ATCGGAAAGCTTGGATCC CCA GTG GC (AG) GCT GTA 
GTT GTT ATG GCT TGC-3'; HindHI restriction site underlined, 
BamHI printed in bold) and antisense primer 3*, as defined 
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above. First, a 5' -incomplete cDNA clone was isolated from a 
phage library (Lambda Uni-Zap) using this probe. By means of 
a vector insert PCR, the missing 5' -terminus was amplified 
from another library (via an unique Spel restriction site in 
the 5' region) and assembled to yield a full-length cDNA 
sequence. The nucleic acid sequence determined by means of 
sequencing is listed in SEQ ID NO: 5. 

Example 2 

Functional complementation of a GnTI defect using 
GntI cDNA upon transient expression in protoplasts of the 
Arabidopsis thaliana cgl mutant. 

Approximately 4 weeks subsequent to sowing, protoplasts were 
isolated from leaves of cgl mutants cultivated under sterile 
conditions (nonstainer plants following 5 backcrosses, ref. 
13), transformed with expression constructs of the complete 
GntI cDNA sequences (NotI cDNA fragments, cf. Fig. 4) in 
sense (pA35N-Als or pA35N-A9s, respectively) or antisense 
orientation (pA35N-Alas or pA35N-A9as, respectively) , and 
cultivated for 96 h at room temperature in the dark (50 ]xg of 
plasmid DNA each per 1 million protoplasts, PEG method accor- 
ding to ref. 32) . Subsequent SDS-PAGE of the protoplast 
extracts and Western-blot analysis (analogous to ref. 13, 33) 
indicated functional complementation of the GnTI defect, i.e. 
complex glycosylation of numerous protein bands upon transi- 
ent expression of the potato Al and tobacco A9 sense con- 
structs, but not of the corresponding antisense constructs in 
protoplasts of the Arabidopsis cgl mutant (data not shown) . 
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Example 3 

Cloning of the binary expression constructs 
pBin-35-Alas and pBin-35-Al-short (cf . Fig. 4) . 

Into the Sail restriction site of the polylinker region 
(corresponding to the one of pUC18) of plant expression 
vector pA35 (ref. 29), a NotI linker was introduced subse- 
quently to the fill-in of the ends (=pA35N) , and the complete 
Al-Gn t X-cDNA (nucleotides 9 to 1657, according to the cDNA in 
Fig. 2) was inserted into pA35N via NotI (sense construct 
pA35N-Als and antisense construct pA35N-Alas, respectively) . 
The expression cassettes of the sense and antisense 
constructs, respectively, were isolated via the terminal 
restriction sites (filled-in Ncol restriction site, partial 
post digestion with Hindlll) as a fragment of about 2410 bp 
and inserted into the EcoRI (filled-in) and Hindlll 
restriction sites of the binary vector pBinl9 (Ref. 30) 
(=pBin-35-Als and pBin-35-Alas , respectively) . The EcoRI 
restriction site of the vector is restored by fusion with the 
equally filled-in Ncol restriction site of the fragment. By 
means of a standard PCR assay (sense primer: KS sequencing 
primer (Stratagene) extended for PCR, 5'-GGC CCC CCC TCG AGG 
TCG ACG GTA TCG-3'; antisense primer: 5 ' -GGGCC TCTAGA CfCGAG 
AGC (CT)AC TAC TCT TCC TTG CTG CTG GCT AAT CTT G-3' , Xbal 
restriction site underlined, Xhol restriction site in ita- 
lics) , there was additionally amplified a 5' -fragment of the 
GntI cDNA at an annealing temperature of 50 °C (nucleotides 9 
to 261, according to the cDNA in Fig. 2 and SEQ ID NO: 1) . 
The PCR product was digested with Xbal (within the antisense 
primer) and NotI (within the 5' -linker of the cDNA) , isolated 
as a fragment of about 260 bp and cloned into pA35N (=pA35N- 
Al-short) . The expression cassette of the short antisense 
construct was also inserted into pBinl9 (=pBin-35-Al-short ) 
as a EcoRI/Hindlll fragment (about 1020 bp) . 
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Example 4 

Transformation of agrobacteria by means of the binary GntI 
constructs and regeneration of transgenic potato and tobacco 
plants, respectively, from infected leaf discs. 

The binary antisense GntI constructs (pBin-35-Alas and pBin- 
35-Al-short) were transformed into the Agrobacterium strain 
GV2260 (ref. 34, 35). By way of example, sterile leaf discs 
of potato plants var. Desiree and of tobacco plants var. 
Wisconsin 38 were infected with the recombinant agrobacterial 
lines (50 ul of a fresh overnight culture in 10 ml liquid 2MS 
medium: 2% sucrose in Murashige & Skoog salt /vitamin standard 
medium, pH 5.6; small pieces of leaf without mi drip; co-cul- 
tivation for 2 days in the dark in phytotrons) . Subsequent to 
washing of the infected leaf pieces in 2MS medium with 250 
ug/ml claforan, transgenic plants were regenerated from said 
pieces in tissue culture under kanamycin selection (potato 
protocol ref. 26; tobacco protocol ref. 36) and analyzed for 
reduced GnTI activity (exemplary shown in Fig, 5 for trans- 
genic potato plants). As apparent from Fig. 5, antisense 
suppression of complex glycoprotein modifiaction was 
successful in transgenic potato plant #439. The determined 
reduction of complex glycoprotein modification was stable in 
this transformant over the entire investigation period of 
several months and has been verified in three tests which 
were performed in an interval of about 1 month each. For the 
respective transgenic tobacco plants, analogous results were 
obtained. 



Example 5 

Production of recombinant potato GnTI protein 
(for the production of antibodies) . 

Recombinant GnTI carrying 10 additional N-terminal histidine 
residues (His-tag) was produced in E. coli by means of the 
pET system (Novagen) and purified by metal-che late affinity 
chromatography. A cDNA fragment comprising nucleotides 275- 
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1395 of the potato GntI cDNA (corresp. to aa 75-446, Fig. 2 
and SEQ ID NO: 1 and 2, respectively) wa.s amplified by 
standard PCR (annealing temperature of 50°C, 30 cycles, ref. 
31) (sense primer GntJ-5'fus: 5'-CATGGATCC CTC GAG AAG CGT 
CAG GAC CAG GAG TGC CGG C-3'; antisense primer Gntl-3'stop: 
5 1 -ATCCCGGGATCCG CTA CGT ATC TTC AAC TCC AAG TTG-3'; Xhol and 
BamHI restriction sites, respectively, are underlined, stop 
codon in italics), and inserted into vector pET16b (Novagen) 
(=pET-His-Al) via the restriction sites of the synthetic 
primer ( 5' -Xhol-Gnt J-BamHI-3 ' ) . Following propagation and 
analysis in E . coli XLl-Blue (Stratagene) the construct was 
stored as a glycerol culture. Competent E. coli BL21(DE3) 
pLysS cells (Novagen) were transformed with pET-His-Al for 
overexpression. Addtition of IPTG ( Isopropyl-l-thio-p-D- 
galactopyranoside, at 0.5-2 mM) to a BL21 culture in loga- 
rithmic growth phase, initially induces the expression of T7 
RNA polymerase (from the bacterial chromosome), and thus, 
also the expression of the recombinant fusion protein under 
control of the T7 promoter in pET vectors (Novagen) . By means 
of metal-chelate chromatography using TALON matrix 
(Clontech) , recombinant potato GnTI was purified from induced 
BL21:pET-His-Al cells under denaturating conditions via its 
His-tag (manufacturer's protocol, Novagen), and the prepara- 
tion was verified with respect to homogeneity by means of 
SDS-PAGE. 

Example 6 

Raising of polyclonal antibodies in rabbits. 

Recombinant potato GnTI (from Expl . 5) was used as an antigen. 
Following the harvest of some milliliters of pre-immune serum, 
the rabbits were subcutaneously injected with 300-500 pg of 
affinity-purified protein together with 25 \ig of GMDP adjuvant 
(Gerbu) in intervals of three weeks. Subsequent to three basis 
injections, the animals were bled from the ear vein 12 to 14 
days after the respective successive injection (boost) , the 
serum harvested (ref. 37) and tested for recognition of 
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recombinant GnTI by Western-blot analyses (dilution 1:200 to 
1:2000). The antiserum of the boosts resulting in the lowest 
background-to-signal ratio were mixed with 0.04% sodium azide, 
aliquoted and kept at +4°C or for long-term storage at -20°C, 
respectively. As shown in Fig. 6, Western-blot analyses of 
tobacco callus cells (BY-2 suspension culture) revealed a 
specific GnTI signal in enriched microsomal fractions, which 
indicates, that antibodies raised against the recombinant 
protein specif ially recognize plant GnTI. The detection was 
carried out with enriched microsomal fractions (ER and Golgi 
vesicles) , since - due to low amounts - it is not possible to 
detect GnTI protein in crude plant extracts by means of the 
employed Western-blot method. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: von Schaewen, Antje Dr. rer. nat . 

(B) STREET : Natruper s t rasse 169a 

(C) CITY: Osnabrueck 

(E) COUNTRY : Germany 

(F) POSTAL CODE (ZIP) : D-49076 

(G) TELEPHONE: +4 9-5 41-68 4 02 9 

(ii) TITLE OF INVENTION: Plant gntl sequences and the use thereof 
the production of plants having reduced or lacking N-acetyl 
glucosaminyl transferase I (GnTI) activity 

(iii) NUMBER OF SEQUENCES: 6 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version 1*1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Solanum tuberosum 

(B) STRAIN: Desiree 

(D) DEVELOPMENTAL STAGE: Sink organ 

(F) TISSUE TYPE: Mesophyll 

(G) CELL TYPE: Leaf cells 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Lambda ZAP II (Eco RI ) 

(B) CLONE: gntl-Al (K) 



(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION: 659. . 667 

(D) OTHER INFORMATION: /function^ "Asn codon :.n this 
context is a potential glycosylat ion site" 
/product^ "N-glycosylation consensus sequence" 
/phenotype= "N-glycans modulate protein 
properties " 

/standard_name= "N-glycosylation site" 
/label= pot-CHO 

/note= "GnTI-coding sequences from annuals do not 
contain this feature" 
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<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 53. . 1393 

(C) IDENTIFICATION METHOD: experimental 
{ D) OTHER INFORMATION : /codon_start= 53 

/function- "initiates complex N-glycans on 
secretory glycoproteins" 
/EC_number= 2.4.1.101 
/product- 

"beta-1, 2-N-acetylglucosaminyltransf erase I" 
/evidence^ EXPERIMENTAL 
/gene= "cgl" 
/standard_name= "gnt I " 
/label- ORF 

/note- "first gntl sequence from potato (unpublished) 

(ix) FEATURE: 

(A) NAME /KEY : 5 ' UTR 

(B) LOCATION: 15 . . 52 



(ix) FEATURE: 

(A) NAME /KEY : 3 ' UTR 

(B) LOCATION: 1394 . . 1655 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 80 . . 139 

(D) OTHER INFORMATION: /function= "membrane anchor (amino 
acids 10-29) " 

/product- "hydrophobic amino acid stretch in GnTI " 
/standard_name= "membrane anchor of a type II 
Golgi protein" 

/note= "identified by comparison with GnTI sequences 
from animals" 



(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION: 1. . 14 

(D) OTHER INFORMATION: /function- "used for cloning the 
cDNA library in Lambda ZAPII" 
/product- "EcoRI/Notl-cDNA adapter" 
/number^ 1 



(ix) FEATURE: 

(A) NAME/KEY: miscj eature 

(B) LOCATION: 165 6. . 1669 

(D) OTHER INFORMATION: /product- "EcoRI /Not I-cD^A adapter" 
/number- 2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



GAATTCGCGG CCGCCTGAGA AACCCTCGAA TTCAATTTCG CATTTGGCAG AG ATG 55 

Met 
1 

AGA GGG AAC AAG TTT TGC TTT GAT TTA CGG TAC CTT CTC GTC GTG GCT 10 3 

Arg Gly Asn Lys Phe Cys Phe Asp Leu Arg Tyr Leu Leu Val Val Ala 
5 10 15 



GCT CTC GCC TTC ATC TAC ATA CAG ATG CGG CTT TTC GCG ACA CAG TCA 
Ala Leu Ala Phe lie Tyr lie Gin Met Arg Leu Phe Ala Thr Gin Ser 



151 
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20 25 30 

GAA TAT GTA GAC CGC CTT GCT GCT GCA ATT GAA GCA GAA AAT CAT TGT 199 
Glu Tyr Val Asp Arg Leu Ala Ala Ala He Glu Ala Glu Asn His Cys 
35 40 45 

ACA AGT CAG ACC AGA TTG CTT ATT GAC AAG ATT AGC CAG CAG CAA GGA 24 7 

Thr Ser Gin Thr Arg Leu Leu He Asp Lys He Ser Gin Gin Gin Gly 
50 55 60 65 

AGA GTA GTA GCT CTT GAA GAA CAA ATG AAG CAT CAG GAC CAG GAG TGC 2 95 

Arg Val Val Ala Leu Glu Glu Gin Met Lys His Gin Asp Gin Glu Cys 
70 75 80 

CGG CAA TTA AGG GCT CTT GTT CAG GAT CTT GAA AGT AAG GGC ATA AAA 34 3 

Arg Gin Leu Arg Ala Leu Val Gin Asp Leu Glu Ser Lys Gly He Lvs 
85 90 95 

AAG TTA ATC GGA GAT GTG CAG ATG CCA GTG GCA GCT GTA GTT GTT ATG 3 91 

Lys Leu He Gly Asp Val Gin Met Pro Val Ala Ala Val Val Val Met 
100 105 HO 

GCT TGC AGT CGT ACT GAC TAC CTG GAG AGG ACT ATT AAA TCC ATC TTA 4 39 

Ala Cys Ser Arg Thr Asp Tyr Leu Glu Arg Thr He Lys Ser He Leu 
115 120 125 

AAA TAC CAA ACA TCT GTT GCA TCA AAA TAT CCT CTT TTC ATA TCC CAG 4 87 

Lys Tyr Gin Thr Ser Val Ala Ser Lys Tyr Pro Leu Phe He Ser Gin 
130 135 140 145 

GAT GGA TCA AAT CCT GAT GTA AGA AAG CTT GCT TTG AGC TAT GGT CAG 5 35 

Asp Gly Ser Asn Pro Asp Val Arg Lys Leu Ala Leu Ser Tyr Gly Gin 
150 155 160 

CTG ACG TAT ATG CAG CAC TTG GAT TAT GAA CCT GTG CAT ACT GAA AGA 58 3 

Leu Thr Tyr Met Gin His Leu Asp Tyr Glu Pro Val His Thr Glu Arg 
165 170 175 

CCA GGG GAA CTG GTT GCA TAC TAC AAG ATT GCA CGT CAT TAC AAG TGG 631 
Pro Gly Glu Leu Val Ala Tyr Tyr Lys He Ala Arg His Tyr Lvs Trp 
180 185 190 

GCA TTG GAT CAG CTG TTT CAC AAG CAT AAT TTT AGC CGT GTT ATC ATA 67 9 

Ala Leu Asp Gin Leu Phe His Lys His Asn Phe Ser Arg Val He He 
195 200 205 

CTA GAA GAT GAT ATG GAA ATT GCT GCT GAT TTT TTT GAC TAT TTT GAG 72 7 

Leu Glu Asp Asp Met Glu He Ala Ala Asp Phe Phe Asp Tyr Phe Glu 
210 215 220 225 

GCT GGA GCT ACT CTT CTT GAC AGA GAC AAG TCG ATT ATG GCT ATT TCT 7 75 

Ala Gly Ala Thr Leu Leu Asp Arg Asp Lys Ser He Met Ala He Ser 
230 235 240 

TCT TGG AAT GAC AAT GGA CAA AGG CAG TTC GTC CAA GAT CCT GAT GCT 82 3 

Ser Trp Asn Asp Asn Gly Gin Arg Gin Phe Val Gin Asp Pro Asp Ala 
245 250 255 
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CTT TAC CGC TCA GAC TTT TTT CCT GGT CTT GGA TGG ATG CT" TCA AAA 
Leu Tyr Arg Ser Asp Phe Phe Pro Gly Leu Gly Trp Met Leu Ser iJJ 
260 265 270 

Ser Jhr Tro Ser cf ^ TGG CCA *™ GCT ™' ^ GAT 

Ser Thr Trp Ser Glu Leu Ser Pro Lys Trp Pro Lys Ala Tyr Trp Asp 

280 285 

GAC TGG CTA AGG CTG AAA GAA AAT CAC AGA GGT CGA CAA TTT ATT CGC 
Asp Trp Leu Arg Leu Lys Glu Asn His Arg Gly Arg £J HI S Arg 

300 3 0 5 

CCA GAA GTT TGC AGA ACG TAC AAT TTT GGT GAG CAT GGT TCT AGT TTG 
Pro Glu val Cys Arg Thr Tyr Asn Phe Gly Glu His Gly Ser Ser leu 
310 315 320 

GGG CAG TTT TTT AAG CAG TAT CTT GAG CCA ATT AAG CTA AAT GAT err 
Gly Gin Phe Phe Lys Gin Tyr Leu Glu Pro He L^s Leu ™ Asp vll 
325 330 335 

CAG GTT GAT TGG AAG TCA ATG GAC CTA AGT TAC CTT TTG GAG GAC AAC 
Gin Val Asp Trp Lys Ser Met Asp Leu Ser Tyr Leu Leu Glu Asp 
■ 3qu 345 35 0 

TAT GTG AAA CAC TTT GGC GAC TTG GTT AAA AAG GCT AAG CCC ATC CAC 
Tyr Val Lys His Phe Gly Asp Leu Val Lys Lys Ala Lys Pro lie His 
s>3 ° 360 3 6 5 

GGA GCT GAT GCT GTT TTG AAA GCA TTT AAC ATA GAT GGT GAT GTC err 
Gly Ala Asp Ala Val Leu Lys Ala Phe Asn He Sp Gly Sp ?I? £J 
30 375 380 38 l 

ATT CAG TAC AGA GAC CAA CTA GAC TTT GAA GAT ATC GCT CGA CAG TTT 
He Gin Tyr Arg Asp Gin Leu Asp Phe Glu Asp lie Ala Arg Gin III 
390 395 400 

GGC ATT TTT GAA GAA TGG AAG GAT GGT GTA CCA CGG GCA GCA TAT AAA 
Gly lie Phe Glu Glu Trp Lys Asp Gly Val Pro Arg Ala Ala Tyr 
405 410 415 

GGG ATA GTA GTT TTC CGG TTT CAA ACA TCT AGA CGT GTG TTC CTT GTT 
Gly lie Val Val Phe Arg Phe Gin Thr Ser Arg Arg Val He Leu Val 
420 425 43 0 

TCC CCT GAT TCT CTT CGA CAA CTT GGA GTT GAA GAT ACT TAG 
Ser Pro Asp Ser Leu Arg Gin Leu Gly Val Glu Asp Thr * 
435 4 40 44 £ 

CGAAGATATG ATTGGAGCCT GAGCAACAAT TTAGACTTAT TTGGTAGGAT ACATTTGAAA 
GAGCTGACAC GAAAAGTATG ACTACCAGTA GCTACATGCA ACATTTTAAT GTTAATGGAA 
GGAACCCACT GCTTATTGTT GGAATGGATG AATCATCACC ACATCCTATT A'.rTCAAGTTT 
ACAAACATAA AGAGGAAATG TTGCCCTATA AAAACAAATT TTTTGTTTCT AAGAAGGAAC 
GTTACGATTA TGAGCAACTT TGGCGGCCGC GAATTC 



871 

919 

967 

1015 

1063 

1111 

1159 

1207 

1255 

1303 

1351 

1393 

1453 
1513 
1573 
1633 
1669 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Arg Gly Asn Lys Phe Cys Phe Asp Leu Arg Tyr Leu Leu Val Val 
15 10 15 

Ala Ala Leu Ala Phe lie Tyr lie Gin Met Arg Leu Phe Ala Thr Gin 
20 25 30 

Ser Glu Tyr Val Asp Arg Leu Ala Ala Ala lie Glu Ala Glu Asn His 
35 40 45 

Cys Thr Ser Gin Thr Arg Leu Leu lie Asp Lys lie Ser Gin Gin Gin 
50 55 60 

Gly Arg Val Val Ala Leu Glu Glu Gin Met Lys His Gin Asp Gin Glu 
65 70 75 80 

Cys Arg Gin Leu Arg Ala Leu Val Gin Asp Leu Glu Ser Lys Gly lie 
85 90 95 

Lys Lys Leu lie Gly Asp Val Gin Met Pro Val Ala Ala Val Val Val 
100 105 110 

Met Ala Cys Ser Arg Thr Asp Tyr Leu Glu Arg Thr lie Lys Ser lie 
115 120 125 

Leu Lys Tyr Gin Thr Ser Val Ala Ser Lys Tyr Pro Leu Phe He Ser 
130 135 140 

Gin Asp Gly Ser Asn Pro Asp Val Arg Lys Leu Ala Leu Ser Tyr Gly 
145 150 155 160 

Gin Leu Thr Tyr Met Gin His Leu Asp Tyr Glu Pro Val His Thr Glu 
165 170 :.75 

Arg Pro Gly Glu Leu Val Ala Tyr Tyr Lys He Ala Arg His Tyr Lys 
180 185 190 

Trp Ala Leu Asp Gin Leu Phe His Lys His Asn Phe Ser Arg Val He 
195 200 205 

He Leu Glu Asp Asp Met Glu He Ala Ala Asp Phe Phe Asp Tyr Phe 
210 215 220 

Glu Ala Gly Ala Thr Leu Leu Asp Arg Asp Lys Ser He Met Ala He 
225 230 235 240 

Ser Ser Trp Asn Asp Asn Gly Gin Arg Gin Phe Val Gin Asp Pro Asp 
245 250 255 

Ala Leu Tyr Arg Ser Asp Phe Phe Pro Gly Leu Gly Trp Met Leu Ser 
260 265 270 
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Lys Ser Thr Trp Ser Glu Leu Ser Pro Lys Trp Pro Lys Ala Tyr Trp 
275 280 285 

Asp Asp Trp Leu Arg Leu Lys Glu Asn His Arg Gly Arg Gin Phe He 
290 295 300 

Arg Pro Glu Val Cys Arg Thr Tyr Asn Phe Gly Glu His Gly Ser Ser 
305 310 

Leu Gly Gin Phe Phe Lys Gin Tyr Leu Glu Pro He Lys Leu Asn Asp 
325 330 JJD 

Val Gin Val Asp Trp Lys Ser Met Asp Leu Ser Tyr Leu Leu Glu Asp 
340 345 ^ u 

Asn Tyr Val Lys His Phe Gly Asp Leu Val Lys Lys Ala Lys Pro He 
355 360 365 

His Gly Ala Asp Ala Val Leu Lys Ala Phe Asn lie Asp Gly Asp Val 
370 375 380 

Arg lie Gin Tyr Arg Asp Gin Leu Asp Phe Glu Asp He Ala Arg Gin 
385 390 395 

Phe Gly He Phe Glu Glu Trp Lys Asp Gly Val Pro Arg Ala Ala Tyr 
405 410 tlD 

Lys Gly He Val Val Phe Arg Phe Gin Thr Ser Arg Arg Val Phe Leu 



420 



Val Ser Pro Asp Ser Leu Arg Gin Leu Gly Val Glu Asp Thr * 
435 440 445 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1737 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Nicotiana tabacum 

(B) STRAIN: Samsun NN 

(D) DEVELOPMENTAL STAGE: Sink organ 

(F) TISSUE TYPE: Mesophyll 

(G) CELL TYPE: Leaf cells 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Lambda ZAP II (Eco RI ) 

(B) CLONE: gntI-A9(T) 
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(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 7 33. .741 

(D) OTHER INFORMATION: /function^ "Asn codon ;.n this 
context is a potential glycosylat ion ;>ite" 
/product^ "N-glycosylation consensus sequence" 
/phenotype= "N-glycans modulate prote;.n 
properties " 

/standard_name= "N-glycosylation site" 
/label= pot-CHO 

/note= "GnTI sequences from animals do not contain this 
feature" 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 127 . . 1467 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /codon_start= 127 

/function= "initiates complex N-glycar.s on 
secretory glycoproteins" 
/EC_number= 2.4.1.101 
/product= 

"beta-1, 2-N-acetylglucosaminyltransf erase I" 
/evidence^ EXPERIMENTAL 
/gene= "cgl" 
/standard_name= "gnt I " 
/label= ORF 

/note= "first gntl sequence from tobacco (unpublished)" 

(ix) FEATURE: 

(A) NAME /KEY : 5 1 UTR 

(B) LOCATION: 15. .126 

(ix) FEATURE: 

(A) NAME /KEY : 3 1 UTR 

(B) LOCATION: 1468 . . 1723 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 154 . .213 

(D) OTHER INFORMATION :/function= "membrane anchor (amino 
acids 10-29) " 

/product^ "hydrophobic amino acid stretch in GnTI" 
/standard_name= "membrane anchor of a type II 
golgi protein" 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1 . . 14 

(D) OTHER INFORMATION: /function^ "use for cloning the 
cDNA library in Lambda ZAPII" 
/product= "EcoRI/Notl-cDNA adapter" 
/number^ 1 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1724 . . 1737 

(D) OTHER INFORMATION: /product= "EcoRI /Not I-cDNA adapter" 
/number= 2 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAATTCGCGG CCGCCATTGA CTTGATCCTA ACTGAACAGG CAAAGTAAAT CCAGCGATGA 60 

AACACTCATA ACTGAACACT GAGAGACTAT TCGCTTTCTC CTAAAGCCTT CAATCGAATT 120 

CGCACG ATG AGA GGG AAC AAG TTT TGC TGT GAT TTC CGG TAC CTC CTC 168 
Met Arg Gly Asn Lys Phe Cys Cys Asp Phe Arg Tyr Leu Leu 
450 455 460 

ATC TTG GCT GCT GTC GCC TTC ATC TAC ACA CAG ATG CGG CTT TTT GCG 216 
lie Leu Ala Ala Val Ala Phe lie Tyr Thr Gin Met Arg Leu Phe Ala 
465 470 475 

ACA CAG TCA GAA TAT GCA GAT CGC CTT GCT GCT GCA ATT GAA GCA GAA 2 64 

Thr Gin Ser Glu Tyr Ala Asp Arg Leu Ala Ala Ala lie Glu Ala Glu 
480 485 490 

AAT CAT TGT ACA AGC CAG ACC AGA TTG CTT ATT GAC CAG ATT AGC CTG 312 
Asn His Cys Thr Ser Gin Thr Arg Leu Leu lie Asp Gin lie Ser Leu 
495 500 505 

CAG CAA GGA AGA ATA GTT GCT CTT GAA GAA CAA ATG AAG CGT CAG GAC 3 60 

Gin Gin Gly Arg lie Val Ala Leu Glu Glu Gin Met Lys Arg Gin Asp 
510 515 520 525 

CAG GAG TGC CGA CAA TTA AGG GCT CTT GTT CAG GAT CTT GAA AGT AAG 4 08 

Gin Glu Cys Arg Gin Leu Arg Ala Leu Val Gin Asp Leu Glu Ser Lys 
530 535 540 

GGC ATA AAA AAG TTG ATC GGA AAT GTA CAG ATG CCA GTG GCT GCT GTA 4 56 

Gly lie Lys Lys Leu lie Gly Asn Val Gin Met Pro Val Ala Ala Val 
545 550 555 

GTT GTT ATG GCT TGC AAT CGG GCT GAT TAC CTG GAA AAG ACT ATT AAA 504 
Val Val Met Ala Cys Asn Arg Ala Asp Tyr Leu Glu Lys Thr "le Lys 
560 565 570 

TCC ATC TTA AAA TAC CAA ATA TCT GTT GCG TCA AAA TAT CCT CTT TTC 5 52 

Ser lie Leu Lys Tyr Gin lie Ser Val Ala Ser Lys Tyr Pro Leu Phe 
575 580 585 

ATA TCC CAG GAT GGA TCA CAT CCT GAT GTC AGG AAG CTT GCT TTG AGC 600 
lie Ser Gin Asp Gly Ser His Pro Asp Val Arg Lys Leu Ala Leu Ser 
590 595 600 605 

TAT GAT CAG CTG ACG TAT ATG CAG CAC TTG GAT TTT GAA CCT GTG CAT 64 8 

Tyr Asp Gin Leu Thr Tyr Met Gin His Leu Asp Phe Glu Pro Val His 
610 615 620 

ACT GAA AGA CCA GGG GAG CTG ATT GCA TAC TAC AAA ATT GCA CGT CAT 69 6 

Thr Glu Arg Pro Gly Glu Leu lie Ala Tyr Tyr Lys lie Ala Arg His 
625 630 635 

TAC AAG TGG GCA TTG GAT CAG CTG TTT TAC AAG CAT AAT TTT AGC CGT 74 4 

Tyr Lys Trp Ala Leu Asp Gin Leu Phe Tyr Lys His Asn Phe Ser Arg 
640 645 650 

GTT ATC ATA CTA GAA GAT GAT ATG GAA ATT GCC CCT GAT TTT TTT GAC 7 92 

Val lie lie Leu Glu Asp Asp Met Glu lie Ala Pro Asp Phe Phe Asp 
655 660 665 



TTT TTT GAG GCT GGA GCT ACT CTT CTT GAC AGA GAC AAG TCG ATT ATG 



840 
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984 



Phe Phe Glu Ala Gly Ala Thr Leu Leu Asp Arg Asp Lys Ser lie Met 
670 675 680 685 

GCT ATT TCT TCT TGG AAT GAC AAT GGA CAA ATG CAG TTT GTC CAA GAT 888 
Ala lie Ser Ser Trp Asn Asp Asn Gly Gin Met Gin Phe Val Gin As£ 
690 695 700 

CCT TAT GCT CTT TAC CGC TCA GAT TTT TTT CCC GGT CTT GGA TGG ATG 936 
Pro Tyr Ala Leu Tyr Arg Ser Asp Phe Phe Pro Gly Leu Gly Trp Met 
705 710 715 

CTT TCA AAA TCT ACT TGG GAC GAA TTA TCT CCA AAG TGG CCA AAG GCT 
Leu Ser Lys Ser Thr Trp Asp Glu Leu Ser Pro Lys Trp Pro Lys Ala 
720 725 730 

TAC TGG GAC GAC TGG CTA AGA CTC AAA GAG AAT CAC AGA GGT CGA CAA 1032 
Tyr Trp Asp Asp Trp Leu Arg Leu Lys Glu Asn His Arg Gly Arg Gin 
'35 740 745 * 

TTT ATT CGC CCA GAA GTT TGC AGA ACA TAT AAT TTT GGT GAG CAT GGT 1080 
Phe He Arg Pro Glu Val Cys Arg Thr Tyr Asn Phe Gly Glu His Gly 
/b0 755 760 7 6 5 

TCT AGT TTG GGG CAG TTT TTC AAG CAG TAT CTT GAG CCA ATT AAA CTA 1128 
Ser Ser Leu Gly Gin Phe Phe Lys Gin Tyr Leu Glu Pro lie Lys leu 
770 775 780 

AAT GAT GTC CAG GTT GAT TGG AAG TCA ATG GAC CTT AGT TAC CTT TTG 117 6 

Asn Asp Val Gin Val Asp Trp Lys Ser Met Asp Leu Ser Tyr Leu Leu 
785 790 795 

GAG GAC AAT TAC GTG AAA CAC TTT GGT GAC TTG GTT AAA AAG GCT AAG 1224 
Glu Asp Asn Tyr Val Lys His Phe Gly Asp Leu Val Lys Lys Ala Lys 
800 805 8 io 

CCC ATC CAT GGA GCT GAT GCT GTC TTG AAA GCA TTT AAC ATA GAT GGT 1272 
Pro lie H 1S Gly Ala Asp Ala Val Leu Lys Ala Phe Asn He Asp Gly 
815 820 825 

GAT GTG CGT ATT CAG TAC AGA GAT CAA CTA GAC TTT GAA AAT ATC GCA 1320 
Asp Val Arg lie Gin Tyr Arg Asp Gin Leu Asp Phe Glu Asn lie Alt 
830 835 840 845 

CGG CAA TTT GGC ATT TTT GAA GAA TGG AAG GAT GGT GTA CCA CGT GCA 13 68 

Arg Gin Phe Gly He Phe Glu Glu Trp Lys Asp Gly Val Pro Arg Ala 
850 855 3 6 o 

GCA TAT AAA GGA ATA GTA GTT TTC CGG TAC CAA ACG TCC AGA CGT GTA 1416 
Ala Tyr Lys Gly He Val Val Phe Arg Tyr Gin Thr Ser Arg Arg Val 
865 870 875 

III III vll r? C S AT ^ AT I CG CTT CAA CAA CTC GGA A TT GAA GAT ACT 14 64 

Phe Leu Val Gly His Asp Ser Leu Gin Gin Leu Gly He Glu Asp Thr 
880 885 890 

TAA CAAAGATATG ATTGCAGGAG CCCGGGCAAA ATTTTTGACT TATTGGGTAG 1517 

GATGCATCGA GCTGACACTA AACCATGATT TTACCAGTTA CATACAACGT TTTAATGTTA 1577 
TAC G GAG GAG CTCACTGTTC TAGTGTTGAA GGGATATCGG CTTCTTAGTA TTGGATGAAT 1637 
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CATCAACACA ACCTATTATT TTAAGTGTTC AGAACATAAA GAGGAAATGT AGCCCTGTAA 1697 
AGACTATACA TGGGACCATC ATAATCGCGG CCGCGAATTC 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 447 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Met Arg Gly Asn Lys Phe Cys Cys Asp Phe Arg Tyr Leu Leu He Leu 

10 15 

Ala Ala val Ala Phe He Tyr Thr Gin Met Arg Leu Phe Ala Thr Gin 

25 30 

Ser Glu Tyr Ala Asp Arg Leu Ala Ala Ala He Glu Ala Glu Asn His 

40 45 

Cys Thr Ser Gin Thr Arg Leu Leu He Asp Gin lie Ser Leu Gin Gin 

Gly Arg He Val Ala Leu Glu Glu Gin Met Lys Arg Gin Asp Gin Glu 

75 80 
Cys Arg Gin Leu Arg Ala Leu Val Gin Asp Leu Glu Ser Lys Gly lie 



85 9 0 95 



Lys Lys Leu lie Gly Asn Val Gin Met Pro Val Ala Ala Val Val Val 

105 110 

Met Ala Cys Asn Arg Ala Asp Tyr Leu Glu Lys Thr He Lys Ser lie 

120 125 

Leu Lys Tyr Gin He Ser Val Ala Ser Lys Tyr Pro Leu Phe He Ser 

Gin Asp Gly Ser His Pro Asp Val Arg Lys Leu Ala Leu Ser Tyr Asp 

155 160 
Gin Leu Thr Tyr Met Gin His Leu Asp Phe Glu Pro Val His Thr Glu 

165 170 175 

Arg Pro Gly Glu Leu He Ala Tyr Tyr Lys He Ala Arg His Tyr Lys 



190 



Trp Ala Leu Asp Gin Leu Phe Tyr Lys His Asn Phe Ser Arg Val He 

200 205 

He Leu Glu Asp Asp Met Glu He Ala Pro Asp Phe Phe Asp Phe Phe 

* 1;> 220 
Glu Ala Gly Ala Thr Leu Leu Asp Arg Asp Lys Ser He Met Ala He 

230 235 240 

Ser Ser Trp Asn Asp Asn Gly Gin Met Gin Phe Val Gin Asp Pro Tyr 
245 250 P 255 Y 



1737 
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A1 a Leu Tyr Arg Ser Asp Phe Phe Pro Gly Leu Gly Trp Met Leu Ser 

260 Zb ^ 
Lys ser Thr Trp Asp Glu Leu ser Pro Ly 5 Trp Pro Ly, Ala Tyr Trp 

A,p Asp Trp Leu Arc, Leu Lys »lu «o Bi. Ar, |ly Ar g Gin Phe He 

290 

Arg Pro Glu Val Cys Arg Thr Tyr Asn Phe Gly Glu His Gly Ser Ser 

305 310 

Le u Gly Gin Phe Phe Lys Gin Tyr Leu Glu Pro He Lys Leu Asn Asp 

Val Gin Val Asp Trp Lys Ser Met Asp Leu Ser Tyr Leu Leu Glu Asp 

340 34b 
A sn Tyr Val Lys His Phe Gly Asp Leu Val Lys Lys Ala Lys Pro He 

355 360 
His Gly Ala Asp Ala Val Leu Lys Ala Phe Asn lie Asp Gly Asp Val 
370 

Arg lie Gin Tyr Arg Asp Gin Leu Asp Phe Glu Asn He Ala Arg Gin 

385 390 

Phe Gly He Phe Glu Glu Trp Lys Asp Gly Val Pro Arg Ala Ala Tyr 

405 410 
L ys Gly He val Val Phe Arg Tyr Gin Thr Ser Arg Arg Val Phe Leu 

420 425 
Val Gly His Asp Ser Leu Gin Gin Leu Gly He Glu Asp Thr * 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1854 base pairs 

(B) TYPE: Nucleotide 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: No 
(iv) ANTI-SENSE: No 

(vi) ORIGINAL SOURCE: _ .. na 

(A) ORGANISM: Arabidopsis thalxana 

(B) STRAIN: Columbia 

D) DEVELOPMENTAL STAGE: Mature plants 
(F) TISSUE TYPE: All tissues 

(Vil) ^XJLS^aibda Uni-ZAP (EcoRI/XhoI) and 

Lambda ACT (Xhol) 
(B) CLONE : pBSK-Ara-Gnt I-f ull #8 

(ix) FEATURE: 
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(A) NAME /KEY : miscjeature 

(B) LOCATION: 1185. . 1193 

(D) OTHER INFORMATION: /function= "Asn Codon is a 
potential glycosylation site" 
/product= "Consensus sequence for 
N-glycosylation" 

/phenotype= "N glycans modulate 

protein characteristics" 
/standard_name= "N glycosylation site" 
/label= pot-CHO 

/note= "absent in animal GnTI sequences" 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 135. .14 69 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /codon_start= 135 

/function^ "initiates complex N glycans on 
secretory glycoproteins" 
/EC_number= 2.4.1.101 
/product^ 

"beta-1, 2-N-acetyl glucosaminyl transferase I" 

/evidence^ EXPERIMENTAL 

/gene= "cgl" 

/ s t andard_name= " gnt I " 

/label= ORF 

/note= "first gntl sequence from Arabidopsis 
(unpublished) " 

(ix) FEATURE: 

(A) NAME /KEY : 5 1 UTR 

(B) LOCATION: 19. . 134 

(ix) FEATURE: 

(A) NAME /KEY : 3 1 UTR 

(B) LOCATION: 14 70. . 1848 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 157 . .215 

(D) OTHER INFORMATION :/function= "membrane anchor 
(amino acids 8-27)" 

/product= "hydrophobic amino-acid region in 
GnTI" 

/standard_name= "membrane anchor of a "ype II 
Golgi protein" 

/note= "identified by comparison with animal GnTI 
sequenzes " 

(ix) FEATURE: 

(A) NAME/KEY: miscjeature 

(B) LOCATION: 1 . . 18 

(D) OTHER INFORMATION: /function^ "for preparation 
of a cDNA library in Lambda ACT" 
/product= "XhoI-cDNA-Adaptor" 
/number^ 1 

(ix) FEATURE: 

(A) NAME /KEY : miscjeature 

(B) LOCATION: 184 9. .18 54 

(D) OTHER INFORMATION: /product^ "XhoI-cDNA-Adciptor " 
/number= 2 



- 56 - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CTCGAGGCCA CGAAGGCCAC CGTTTTTGTT ATAACGAACG ACACCGTTTC AAACAACTTC 

CTTATTAGCT AGCTCCCTCC CGGCGGCAAA CACCAGAAGA TCCACCGCTT TTGATCTGGT 

TGTTTGTCGT CGAT ATG GCG AGG ATC TCG TGT GAC TTG AGA TTT CTT CTC 
Met Ala Arg He Ser Cys Asp Leu Arg Phe Leu Leu 
15 10 

ATC CCG GCA GCT TTC ATG TTC ATC TAC ATC CAG ATG AGG CTT TTC CAG 
He Pro Ala Ala Phe Met Phe He Tyr He Gin Met Arg Leu Phe Gin 
15 20 25 

ACG CAA TCA CAG TAT GCA GAT CGC CTC AGT TCC GCT ATC GAA TCT GAG 
Thr Gin Ser Gin Tyr Ala Asp Arg Leu Ser Ser Ala He Ser G^u 

JU 35 40 

AAC CAT TGC ACT AGT CAA ATG CGA GGC CTC ATA GAT GAA GTT AGC ATC 
Asn His Cys Thr Ser Gin Met Arg Gly Leu He Asp Glu Val Ser lie 
45 50 55 60 

AAA CAG TCG CGG ATT GTT GCC CTC GAA GAT ATG AAG AAC CGC CAG GAC 
Lys Gin Ser Arg lie Val Ala Leu Glu Asp Met Lys Asn Arg Gin 2p 
65 70 75 

GAA GAA CTT GTG CAG CTT AAG GAT CTA ATC CAG ACG TTT GAA AAA AAA 
Glu Glu Leu Val Gin Leu Lys Asp Leu He Gin Thr Phe Glu Lys^ Lys 
80 85 90 

GGA ATA GCA AAA CTC ACT CAA GGT GGA CAG ATG CCT GTG GCT GCT GTA 
Gly lie Ala Lys Leu Thr Gin Gly Gly Gin Met Pro Val Ala Ala Vat 
95 100 10 5 

vll vll mI? a?° p GC c GT CGT GCA GAC ™ CTT GAA AGG ACT GTT AAA 
Val val Met Ala Cys Ser Arg Ala Asp Tyr Leu Glu Arg Thr Val Lys 

llu 115 120 

TCA GTT TTA ACA TAT CAA ACT CCC GTT GCT TCA AAA TAT CCT CTA TTT 
Ser Val Leu Thr Tyr Gin Thr Pro Val Ala Ser Lys Tyr Pro Leu III 

130 135 140 

ATA TCT CAG GAT GGA TCT GAT CAA GCT GTC AAG AGC AAG TCA TTG AGC 
He Ser Gin Asp Gly Ser Asp Gin Ala Val Lys Ser Lys Ser leu Ser 
145 150 :L55 



60 
120 
170 

218 

266 

314 

362 

410 

458 

506 

554 

602 

650 

698 



746 



794 
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TAC TTT GAG GCT GCA GCT AGT CTC ATG GAT AGG GAT AAA AC- ATT ATC 
Tyr Phe Glu Ala Ala Ala Ser Leu Met Asp Arg Asp i£ ?fc III 
225 2 3o 235 

GCT GCT TCA TCA TGG AAT GAT AAT GGA CAG AAG CAG TTT GTG CAT GAT 
Ala Ala Ser Ser Trp Asn Asp Asn Gly Gin Lys Gin Phe Val Sis Asp 
240 245 250 

CCC TAT GCG CTA TAC CGA TCA GAT TTT TTT CCT GGC CTT GGG TGG ATG 
Pro Tyr Ala Leu Tyr Arg Ser Asp Phe Phe Pro Gly Leu Glv £rp Met 
"3 260 265 

CTC AAG AGA TCG ACT TGG GAT GAG TTA TCA CCA AAG TGG CCA AAG GCT 
Leu Lys Arg Ser Thr Trp Asp Glu Leu Ser Pro Lys Trp Pro L^s All 

275 280 

TAC TGG GAT GAT TGG CTG AGA CTA AAG GAA AAC CAT AAA GGC CGC CAA 
Tyr Trp Asp Asp Trp Leu Arg Leu Lys Glu Asn His Lys Gly Arg Sn* 

290 295 300 

TTC ATT GCA CCG GAA GTC TGT AGA ACA TAC AAT TTT GGT GAA CAT GGC 
Phe lie Ala Pro Glu Val Cys Arg Thr Tyr Asn Phe Gly gTu Ss Gly 
305 310 315 

TCT AGT TTG GGA CAG TTT TTC AGT CAG TAT CTG GAA CCT ATA AAG CTA 
Ser Ser Leu Gly Gin Phe Phe Ser Gin Tyr Leu Glu Pro He L^s "u 
320 325 33 0 

AAC GAT GTG ACG GTT GAC TGG AAA GCA AAG GAC CTG GGA TAC CTG ACA 
Asn Asp Val Thr Val Asp Trp Lys Ala Lys Asp Leu Gly Tyr Leu 5S 
• 3J;5 340 345 

GAG GGA AAC TAT ACC AAG TAC TTT TCT GGC TTA GTG AGA CAA CCA rra 
Glu Gly Asn Tyr Thr Lys Tyr Phe Ser Gly Leu* vll £rg* SJ £J £J 
J0U 355 3 6 o 

CCA ATT CAA GGT TCT GAC CTT GTC TTA AAG GCT CAA AAC ATA AAG GAT 
Pro lie Gin Gly Ser Asp Leu Val Leu Lys Ala Gin Jtn lH L^s Sp 

30 375 330 

GAT GAT CGT ATC CGG TAT AAA GAC CAA GTA GAG TTT GAA CGC ATT GCA 
Asp Asp Arg He Arg Tyr Lys Asp Gin Val Glu Phe Glu Arg III £a 
385 390 395 

GGG GAA TTT GGT ATA TTT GAA GAA TGG AAG GAT GGT GTG CCA CGA ACA 
Gly Glu Phe Gly He Phe Glu Glu Trp Lys Asp Gly Val PrJ Arg T*hr* 
400 405 410 

GCA TAT AAA GGA GTA GTG GTG TTT CGA ATC CAG ACA ACA AGA CGT GTA 
Ala Tyr Lys Gly Val Val Val Phe Arg He Gin Thr ?S Arg Arg Val 
15 420 425 

TTC CTG GTT GGG CCA GAT TCT GTA ATG CAG CTT GGA ATT CGA ^AT TCC 
Phe Leu Val Gly Pro Asp Ser Val Met Gin Leu Gly ill £g ^ llr 

4 35 440 

TGA TGCAAAACAT ATGAAAGGAA AAGAAGATTT TGGACCGCAT GCAGCCTCCT 
445 

TCTAGCAGCT GTTAGGTTGT ATTGTTATTT ATGGATGAGT TTGTAGAGCG G^GGGGTTAA 
CTTTAACAGC AAGGAAGCTC TGGTGACCAG GCTGATTGGC TTAGAAGTTA TGGGAACCCC 



842 

890 

938 

986 

1034 

1082 

1130 

1178 

1226 

1274 

1322 

1370 

1418 

1466 

1519 

1579 
1639 
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TTGAAAGGGT CAGGGTTAAA TATATTTCAG TTGTTTTATT AGTGATTATC TTGTGGGTAA 
CTTATACGAA TGCAAATCAT TCTATGCAGT TTTTCTTCGT CCCACTTGTT TTGGCTTCTC 
TATTGCTAGT GTACATATCT CTTCAAACAT GTACTAAATA ATGCGTGTTG CTTCAAAGAA 
GTAACTTTTA TTAAAAAAAA AAAAAAAAAC TCGAG 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 445 amino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met Ala Arg lie Ser Cys Asp Leu Arg Phe Leu Leu He Pro Ala Ala 

5 10 15 

Phe Met Phe lie Tyr He Gin Met Arg Leu Phe Gin Thr Gin Ser Gin 

25 30 
Tyr Ala Asp Arg Leu Ser Ser Ala lie Glu Ser Glu Asn His Cyg ^ 

40 45 

Ser Gin Met Arg Gly Leu lie Asp Glu Val Ser He Lys Gin Ser Arg 

" 60 
lie Val Ala Leu Glu Asp Met Lys Asn Arg Gin Asp Glu Glu Leu Val 

75 80 
Gin Leu Lys Asp Leu He Gin Thr Phe Glu Lys Lys Gly He Ala Lys 

90 95 

Leu Thr Gin Gly Gly Gin Met Pro Val Ala Ala Val Val Val Met Ala 

105 110 

Cys Ser Arg Ala Asp Tyr Leu Glu Arg Thr Val Lys Ser Val Leu Thr 

120 125 

Tyr Gin Thr Pro Val Ala Ser Lys Tyr Pro Leu Phe He Ser Gin Asp 

135 14Q * 

Gly Ser Asp Gin Ala Val Lys Ser Lys Ser Leu Ser Tyr Asn Gin Leu 

155 160 
Thr Tyr Met Gin His Leu Asp Phe Glu Pro Val Val Thr Glu Arg Pro 

Gly Glu Leu Thr Ala Tyr Tyr Lys lie Ala Arg His Tyr Lys Trp Ala 

Leu Asp Gin Leu Phe Tyr Lys His Lys Phe Ser Arg Val He He Leu 

ZOO 205 

Glu Asp Asp Met Glu He Ala Pro Asp Phe Phe Asp Tyr Phe Glu Ala 

220 

Ala Ala Ser Leu Met Asp Arg Asp Lys Thr lie Met Ala Ala Ser Ser 



1699 
1759 
1819 
1854 
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225 

Trp Asn 



Asp Asn Gly 
245 



230 

Gin Lys Gin Phe 



235 

Val His Asp Pro 
250 



240 

Tyr Ala Leu 
255 



Tyr Arg 
Thr Trp 



Trp Leu 
290 

Glu Val 
305 

Gin Phe 



Val Asp 
Thr Lys 



Ser Asp 
370 



Ser Asp Phe 
260 

Asp Glu Leu 
275 

Arg Leu Lys 



Cys Arg Thr 



Phe Ser Gin 
325 

Trp Lys Ala 
340 

Tyr Phe Ser 
355 

Leu Val Leu 



Phe Pro 

Ser Pro 

Glu Asn 
295 

Tyr Asn 
310 

Tyr Leu 
Lys Asp 
Gly Leu 



Lys Ala 
375 



Gly Leu 
265 

Lys Trp 
280 

His Lys 
Phe Gly 
Glu Pro 



Leu Gly 
345 

Val Arg 
360 



Gly Trp Met Leu 



Pro Lys Ala Tyr 
285 

Gly Arg Gin Phe 
300 

Glu His Gly Ser 
315 

lie Lys Leu Asn 

330 

Tyr Leu Thr Glu 



Gin Ala Arg Pro 
365 



Gin Asn lie Lys Asp Asp 
380 



Lys A.rg Ser 
270 

Trp A.sp Asp 



lie A.la Pro 

Ser Leu Gly 
320 

Asp Val Thr 
335 

Gly Asn Tyr 
350 

He Gin Gly 
Asp Arg He 



Arg Tyr 
385 

He Phe 



Lys Asp Gin 



Glu Glu Trp 
405 



Val Glu Phe Glu Arg He Ala Gly 
390 395 



Lys Asp Gly Val 



Pro Arg Thr Ala 
410 



Glu Phe Gly 
400 

Tyr Lys Gly 
415 



Val Val 



Val Phe Arg 
420 



He Gin 



Thr Thr 
425 



Arg Arg Val Phe 



Leu Val Gly 
430 



Pro Asp 



Ser Val Met 
435 



Gin Leu 



Gly He 
440 



Arg Asn Ser 



445 
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CLAIMS 

Method for the production of glycoproteins displaying 
minimal, uniform and defined sugar residues, comprising 
cultivating a transgenic plant, parts of transgenic 
plants or transformed plant cells, and isolating the 
desired glycoprotein from the material cultivated, 
characterized in that the transgenic plant, parts of 
transgenic plants or transformed plant cells, 
respectively, is/are transformed with an antisense 
construct or a sense construct, comprising an antisense 
DNA or a sense DNA with respect to the DNA sequence for 
a gene or a cDNA for plant N-acetyl glucosaminyl 
transferase I or a part thereof, for elimination or 
reduction of the activity of said N-acetyl glucosaminyl 
transferase, wherein the antisense or sense construct 
optionally contains additional regulatory sequences for 
the transcription of the respective antisense or sense 
DNA. 

Method according to claim 1, characterized in that for 
transformation an antisense or sense construct with 
respect to one of the cDNAs encoding N-acetyl gluco- 
saminyl transferase I from Solanum tuberosum, Nicotiana 
tabacum or Arabldopsis thaliana is used. 



r 



Method according to claim 2, characterized in that fo 
transformation an antisense or sense construct with 
respect to one of the DNA sequences given in SEQ ID NO: 
1, 3 or 5 is used. 

Method according to any of the claims 1 to 3, 
characterized in that the transgenic plant used is 
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additionally transformed with the gene encoding the 
desired glycoprotein. 

5. DNA, characterized in that it encodes N-acetyl 
glucosaminyl transferase I from Solanum tuberosum. 

6. DNA according to claim 5, characterized in that it 
comprises the nucleotide sequence given in SEQ ID NO: 1 
or a part thereof. 

7. DNA, characterized in that it encodes N-acetyl 
glucosaminyl transferase I from Nicotiana tabacum. 

8. DNA according to claim 7, characterized in that it 
comprises the nucleotide sequence given in SEQ ID NO: 3 
or a part thereof. 

9. DNA encoding N-acetyl glucosaminyl transferase I from 
Arabidopsis thaliana, characterized in that said DNA 
encodes the amino-acid sequence given in SEQ ID NO: 6 or 
the nucleotide sequence given in SEQ ID NO: 5 or a part 
thereof . 

DNA, characterized in that it comprises the nucleotide 
sequence complementary to the DNA according to claim 6 
8 or 9. 

11. DNA, characterized in that it may be obtained by 
substitution, deletion and/or insertion of one or more 
nucleotides and/or truncation at the 5' and/or 3' end of 
one of the DNAs according to any of the claims 5 to 10, 
with the proviso, that said DNA hybridizes at least in a 
partial region with the starting DNA or its 
complementary sequence or parts thereof under stringent 
conditions . 
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12. DNA, characterized in that it represents a gene or is 
part of a gene, which encodes the enzyme N-acetyl 
glucosaminyl transferase I, and which in its entirety or 
in a partial region thereof hybridizes under stringent 
conditions 

- to one of the DNA sequences or fragments according to 
any of the claims 5 to 11 and/or 

- to a DNA sequence, which has been derived from the 
amino acid sequences given in SEQ ID NO: 1, 3 and/or 5, 
considering the degeneration of the genetic code. 

13. DNA construct, 

characterized in that it comprises one or more of the 
DNAs according to any of the claims 5 to 14. 

14. DNA construct according to claim 13, 

characterized in that it comprises an antisense or sense 
DNA with respect to the DNA sequence according to any of 
the claims 5 to 12 and optionally regulatory sequences 
for the transcription of the antisense or sense DNA, 
respectively. 

15. Vector, plasmid, cosmid, virus or phage genome, 
characterized in that it contains at least a DNA and/or 
construct according to any of the claims 5 to 14. 

16. N-acetyl glucosaminyl transferase I from Solanum tubero- 
sum. 

17. N-acetyl glucosaminyl transferase I from Nlcotiana taba- 

cum . 

18. N-acetyl glucosaminyl transferase I from Arabidopsis 
thaliana , characterized in that the enzyme comprises the 
amino acid sequence set forth in SEQ ID NO: 6. 
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19. N-acetyl glucosaminyl transferase I, characterized in 
that the enzyme comprises the amino acid sequence set 
forth in SEQ ID NO: 2. 

20. N-acetyl glucosaminyl transferase I, characterized in 
that the enzyme comprises amino acids 7 4 to 44 6 of the 
amino acid sequence set forth in SEQ ID NO: 2. 

21. N-acetyl glucosaminyl transferase I,' characterized in 
that the enzyme comprises the amino acid sequence set 
forth in SEQ ID NO: 4. 



22 



23. 



24 



25 



26 



N-acetyl glucosaminyl transferase I, available due to 
hybridization of its gene or one or more of the portions 
of its gene to one or more of the DNAs and/or DNA 
fragments according to any of the claims 5 to 12. 

Enzymes or proteins derived from the enzymes according 
to any of the claims 16 to 22 by substitution, deletion, 
insertion and/or modification of individual amino acids 
and/or smaller groups of amino acids and/or by N- and/or 
C-terminal truncation and/or extension. 

Protein or peptide, comprising one or more portions of 
the amino acid sequence (s) of one or more of the enzymes 
defined in any of the claims 16 to 23. 

Protein or peptide, encoded by one of the DNAs according 
to any of the claims 5 to 12. 

Antigen, characterized in that it comprises: 

- the amino acid sequence given in SEQ ID NO: 2, SEQ ID 
NO: 4 or SEQ ID NO: 6, or 

- amino acids 74 to 446 of the amino acid sequence given 
in Fig. 2, or 

- an amino acid sequence derived from the amino add 
sequences given in SEQ ID NO: 2, 4 or 6 by substitution, 
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deletion, insertion and/or modification of individual 
amino acids and/or smaller groups of amino acids, or 
- one or more parts of said sequences, 

with the proviso, that upon immunization of a host with 
the antigen, said antigen may raise an immunological 
reaction, including the production of antibodies 
directed against the antigen. 

27. Monoclonal or polyclonal antibody, characterized in that 
it specifically recognizes and binds one or more of the 
enzymes or antigens according to any of the claims 16 to 
26. 



28 . Microorganism, 

characterized in that it is transformed by at least one 
of the nucleotide sequences selected from the DNAs, 
constructs, vectors, plasmids, cosmids, virus or phage 
genomes according to one or more of the claims 5 to 15. 



29. Transgenic plant, transgenic seed, transgenic repro- 
duction material, parts of transgenic plants or trans- 
formed plant cell, obtainable by integration of one or 
more DNA sequence (s) or construct (s) according to any of 
the claims 5 to 13 under the control of a promoter 
effective in plants, into the genome of a plant, or via 
infection by means of a virus containing one or more DNA 
sequence (s) or construct (s) according to any of the 
claims 5 to 13, for an extrachromosomal propagation and 
expression of the DNA sequence (s) or construct (s) in the 
plant tissue infected. 



30. Transgenic plant, transgenic seed, transgenic repro- 
duction material, parts of transgenic pla'nts or trans- 
formed plant cell with missing or reduced N-acetyl 
glucosaminyl transferase I activity, obtainable by inte- 
gration of one or more antisense or sense construct (s) 
according to claim 14 under the control of a promoter 
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effective in plants, into the genome of a plant, or by 
viral infection by means of a virus containing one or 
more antisense or sense construct (s) according to claim 
14, for an extrachromosomal propagation and 
transcription of the antisense construct (s) in the plant 
tissue infected. 
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SUMMARY OF THE INVENTION 

This invention relates to plant GntI sequences, in particular 
to plant nucleic acid sequences encoding the enzyme N-acetyl 
glucosaminyl transferase I (GnTI) , DNA sequences derived 
therefrom, including GntI antisense and sense constructs, and 
the translation products thereof, antibodies directed against 
said translation products, as well as the use of the sequence 
information for the production of transformed microorganisms 
and transgenic plants, including those having reduced or 
missing N-acetyl glucosaminyl transferase I activity. Such 
plants displaying reduced or lacking N-acetyl glucosaminyl 
transferase I activity are of great importance for the pro- 
duction of glycoproteins of specific constitution with 
respect to their sugar residues. 
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Figure 1 
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Figure 2 
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Figure 2 ( continued) 



GCT GGA GCT ACT CTT CTT GAC AGA GAC AAG TCG ATT ATG GCT ATT TCT 775 
Ala Gly Ala Thr Leu Leu Asp Arg Asp Lys Ser He Met Ala He Ser 
230 235 240 

Tf*T TGG AAT r,Af A AT GG A CAA AGG CAG TTC GTC CAA GAT CCT GAT GCT 823 
Ser Trp Asn Asp Asn Gly Gin Arg Gin Phe Val Gin Asp Pro Asp Ala 
245 250 255 

CTT TAC CGC TCA r,AC TTT T TT CCT GGT CTT GGA TGG ATG CTT TCA AAA 871 
Leu Tyr Arg Ser Asp Phe Phe Pro Gly Leu Gly Trp Met Leu Ser Lys 
260 265 270 

TCA ACT TGG TCC GAA CTA TCT CCA AAG TGG CCA AAG GCT TAC TCG GAT 919 
Ser Thr Trp Ser Glu Leu Ser Pro Lys Trp Pro Lys Ala Tyr Trp Asp 
275 280 285 

GAC TGG CTA AGG CTG AAA GAA AAT CAC AGA GGT CGA CAA TTT ATT CGC 9 67 

Asp Trp Leu Arg Leu Lys Glu Asn His Arg Gly Arg Gin Phe He Arg 
290 295 300 305 

CCA GAA GTT TGC AGA ACG TAC AAT TTT GGT GAG CAT GGT TCT MT TTC 1015 
Pro Glu Val Cys Arg Thr Tyr Asn Phe Gly Glu His Gly Ser Ser Leu 
310 315 320 

GGG CAG TTT TTT AAG CAG TAT CTT GAG CCA ATT AAG CTA AAT GAT GTC 1063 
Gly Gin Phe Phe Lys Gin Tyr Leu Glu Pro He Lys Leu Asn Asp Val 
325 330 335 

CAG GTT GAT TGG AAG TCA ATG GAC CTA AGT TAC CTT TTG GAG GAC AAC 1111 
Gin Val Asp Trp Lys Ser Met Asp Leu Ser Tyr Leu Leu Glu Asp Asn 
340 345 350 

TAT GTG AAA CAC TTT GGC GAC TTG GTT AAA AAG GCT AAG CCC ATC CAC 1159 
Tyr Val Lys His Phe Gly Asp Leu Val Lys Lys Ala Lys Pro He His 
355 360 365 

GGA GCT GAT GCT GTT TTG AAA GCA TTT AAC ATA GAT GGT GAT GTG CGT 1207 
Glv Ala Asp Ala Val Leu Lys Ala Phe Asn He Asp Gly Asp Val Arg 
370 375 380 385 

ATT CAG TAC AGA GAC CAA CTA GAC TTT GAA GAT ATC GCT CGA CAG TTT 1255 
He Gin TVr Arg Asp Gin Leu Asp Phe Glu Asp He Ala Arg Gin Phe 
390 395 <L00 

GGC ATT TTT GAA GAA TGG AAG GAT GGT GTA CCA CGG GCA GCA TAT AAA 13 03 

Gly He Phe Glu Glu Trp Lys Asp Gly Val Pro Arg Ala Ala Tyr Lys. 
405 410 415 

GGG ATA GTA GTT TTC CGG TTT CAA AC A TCT AGA CGT GTG TTC CTT GTT 1351 
Gly He Val Val Phe Arg Phe Gin Thr Ser Arg Arg Val Phe Leu Val 
420 425 430 

TCC CCT GAT TCT CTT CGA CAA CTT GGA GTT GAA GAT ACT TAG 1393 
Ser Pro Asp Ser Leu Arg Gin Leu Gly Val Glu Asp Thr End 
435 440 445 

CGAAGATATC ATTGGAGCCT GAGCAACAAT TTAGACTTAT TTGGT AGG AT ACATTTGAAA 1453 

GAGCTCACAC GAAAAGTATC ACTACCAGTA GCTACATGCA AC ATTTT AAT GTTAATGGAA 1513 

GGAACCCACT GCTTATTGTT GGAATGGATG AATCATCACC ACATCCTATT ATTCAAGTTT 1573 

ACAAACAT;^ AGAGGAAATG TTGCCCTATA AAAACAAATT TTTTCTTTCT A^GAAGGAAC 1633 

GTTACGATTA TCAGCAACTT TGGCGGCCGC GAATTC 1669 
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Figure 3B 
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Figure 4 
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Claims (under Art. 34 PCT) 

Method for the production of glycoproteins displaying 
minimal, uniform GlcNac 2 Man 5 -residues, comprising culti- 
vating a transgenic plant, parts of transgenic plants or 
transformed plant cells, and isolating the desired gly- 
coprotein from the material cultivated, characterized in 
that the transgenic plant, parts of transgenic plants or 
transformed plant cells, respectively, is/are trans- 
formed with an antisense construct or a sense construct, 
comprising an antisense DNA or a sense DNA with respect 
to the DNA sequence for a gene or a cDNA for plant N- 
acetyl glucosaminyl transferase I or a part thereof, for 
elimination or reduction of the activity of said N- 
acetyl glucosaminyl transferase, wherein the antisense 
or sense construct optionally contains additional regu- 
latory sequences for the transcription of the respective 
antisense or sense DNA. 

Method according to claim 1, characterized in that for 
transformation an antisense or sense construct with re- 
spect to one of the cDNAs encoding N-acetyl glucosaminyl 
transferase I from Solanum tuberosum, Nicotiana tabacum 
or Arabidopsis thaliana is used. 

Method according to claim 2, characterized in that for 
transformation an antisense or sense construct with re- 
spect to one of the DNA sequences given in SEQ ID NO: 1, 
3 or 5 is used. 

Method according to any of the claims 1 to 3, character- 
ized in that the transgenic plant used is additionally 
transformed with the gene encoding the desired glycopro- 
tein. 



DNA, characterized in that it encodes N-acetyl glucosa- 
iainyl transferase I from Solanum tuberosum and comprises 
the nucleotide sequence given in SEQ ID NO: 1. 

DNA, characterized in that it encodes N-acetyl glucosa- 
minyl transferase I from Nicotians tabacum and comprises 
the nucleotide sequence given in SEQ ID NO: 3. 

DNA encoding N-acetyl glucosaminyl transferase I from 
Arabidopsis thaliana, characterized in that said DNA en- 
codes the amino-acid sequence given in SEQ ID NO: 6 or 
the nucleotide sequence given in SEQ ID NO: 5. 

DNA, characterized in that it comprises the nucleotide 
sequence complementary to the DNA according to claim 5, 
6 or 7. 

DNA, characterized in that it may be obtained by substi- 
tution, deletion and/or insertion of one or more nucleo- 
tides and/or truncation at the 5' and/or 3' end of one 
of the DNAs according to any of the claims 5 to 8, with 
the proviso, that said DNA hybridizes with the starting 
DNA or its complementary sequence under stringent condi- 
tions • 

DNA, characterized in that it represents a gene or is 
part of a gene, which encodes the enzyme N-acetyl gluco- 
saminyl transferase I, and which hybridizes under strin- 
gent conditions 

- to one of the DNA sequences according to any of the 
claims 5 to 9 and/or 

- to a DNA sequence, which has been derived from the 
amino acid sequences given in SEQ ID NO: 1, 3 and/or 5, 
considering the degeneration of the genetic code. 
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20. Antigen, characterized in that it comprises: 

the amino acid sequence given in SEQ ID NO: 2, SEQ ID 
NO: 4 or SEQ ID NO: 6, or 

amino acids 74 to 446 of the amino acid sequence 
given in Fig. 2, or 

an amino acid sequence derived from the amino acid 
sequences given in SEQ ID NO: 2, 4 or 6 by substitu- 
tion, deletion, insertion and/or modification of in- 
dividual amino acids and/or smaller groups of amino 
acids, 

with the proviso, that upon immunization of a host with 
the antigen, said antigen may raise an immunological re- 
action, including the production of antibodies directed 
against the antigen. 

21. Monoclonal or polyclonal antibody, characterized in that 
it specifically recognizes and binds one or more of the 
enzymes or antigens according to any of the claims 14 to 
20. 

22. Microorganism, characterized in that it is transformed 
by at least one of the nucleotide sequences selected 
from the DNAs, constructs, vectors, plasmids, cosmids, 
virus or phage genomes according to one or more of the 
claims 5 to 13. 

23. Transgenic plant, transgenic seed, transgenic reproduc- 
tion material, parts of transgenic plants or transformed 
plant cell with missing or reduced N-acetyl glucosaminyl 
transferase I activity, obtainable by integration of one 
or more antisense or sense construct (s) according to any 
of claims 5 to 13 under the control of a promoter effec- 
tive in plants, into the genome of a plant, or by viral 
infection by means of a virus containing one or more an- 
tisense or sense construct (s) according to any of claims 
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5 to 13, for an extrachromosomal propagation and tran- 
scription of the antisense construct (s) in the plant 
tissue infected. 

24. Transgenic plant, transgenic seed, transgenic reproduc- 
tion material, parts of transgenic plants or transformed 
plant cell with missing or reduced N-acetyl glucosaminyl 
transferase I activity, obtainable by integration of one 
or more antisense or sense construct (s) according to 
claim 12 under the control of a promoter effective in 
plants, into the genome of a plant, or by viral infec- 
tion by means of a virus containing one or more an- 
tisense or sense construct (s) according to claim 12, for 
an extrachromosomal propagation and transcription of the 
antisense construct (s) in the plant tissue infected* 



